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(54) Title: BROTHER OF THE REGULATOR OF IMPRIN^FED SITES (BORIS) 

(57) Abstract: An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding a human or 
a non-human BORIS, or a fragment of either of the foregoing; an isolated or purified nucleic acid molecule consisting essentially 
of a nucleotide sequence that is complementary to a nucleotide sequence encoding a human or a non-human BORIS, or a fragment 
of eifher of the foregoing; a vector comprising such an isolated or purified nucleic acid molecule; a cell comprising such a vector; 
an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding a human or a non-human 
BORIS, or a fragment of either of the foregoing; a cell line that produces a monoclonal antibody that is specific for an aforementioned 
isolated or purified polypeptide molecule; and the monoclonal antibody produced by the cell line; methods of diagnosing a cancer or 
a predisposiuon lo a cancer in a male or female mammal; a method of prognosticating a cancer in a mammal; a method of assessing 
the ciTeciiveness of treatment of a cancer in a mammal; a method of treating a mammal prophylaclically or therapeutically for a 
cancer, and a composition comprising a carrier and an inhibitor of BORIS. 
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BROTHER OF THE REGULATOR OF IMPRINTED SITES (BORIS) 



FIELD OF THE INVENTION 
[0001] This invention pertains to the cancer-testis gene family member BORIS and its 
use in the diagnosis^ prognosis and treatment of cancer, 

BACKGROUND OF THE INVENTION 
(0002] The American Cancer Society estimates the Hfetime risk that an individual will 
develop cancer is 1 in 2 for men and 1 in 3 for women. The development of cancer, while 
still not completely understood, can be enhanced as a result of a variety of risk factors. For 
example, exposure to environmental factors (e.g., tobacco smoke) might trigger 
modifications in certain genes, thereby initiating cancer development. Alternatively, these 
genetic modifications may not require an exposure to environmental factors to become 
abnormal. Indeed, certain mutations (e.g., insertions, deletions, substitutions; etc.) or 
abnormally imprinted genes can be inherited from generation to generation, thereby 
imparting an individual with a genetic predisposition to develop cancer. 
[0003] Currently, the survival rates for many cancers are on the rise. One reason for 
this success is improvement in the detection of cancer at a stage at which treatment can be 
effective. Indeed, it has been noted that one of the most effective means to survive cancer is 
to detect its presence as early as possible. According to the American Cancer Society, the 
relative survival rate for many cancers would increase by about 15% if individuals 
participated in regular cancer screenings. Therefore, it is becoming increasingly usefiil to 
develop novel diagnostic tools to detect the cancer either before it develops or at an as early 
stage of development as possible. 

[0004] One popular way of detecting cancer early is to analyze the genetic makeup of 
an individual to detect the presence of or to measure expression levels of a marker gene(s) 
related to the cancer. For example, there are various diagnostic methods that analyze a 
certain gene or a pattern of genes to detect cancers of the breast, tongue, mouth, colon, 
rectum, cervix, prostate, testis, and skin. Recently, analyzing the activity of certain DNA- 
binding proteins, such as the CCCTC-binding factor (CTCF), has been found to be useful in 
diagnosing a cancer or a predisposition to a cancer (see, e.g., U.S. Patent No. 5,972,643). 
CTCF and similar DNA*binding proteins can act as transcription factors which regulate 
gene expression, including genes involved in cell proliferation. Normally, CTCF inhibits 
cell proliferation; however, a partial loss of CTCF functions caused by abnormal 
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methylation of certain CTCF target sites, or by zinc finger mutations, has been shown to be 
associated with cancer. 

[0005] Recent efforts have brought together the fields of genomic imprinting, DNA 
methylation, gene regulation through transcriptional insulators, and cancer. Genomic 
imprinting occurs in mammals before or during gamete formation. Certain genes are 
uniquely imprinted in each of a male and female parent; however, only one of these genes 
from either the maternal or paternal chromosome is expressed in their offspring; the other of 
which remains silent. The inheritance of imprinted genes is epigenetic, meaning these 
genes are regulated the same in the offspring as in the parent from which they derived, even 
if the nucleotide sequence encoding the gene(s) is not identical to the parental form (e.g., 
has accrued one or more mutations). As a result of this phenomenon, specific genes either 
are expressed or remain silent, based on their imprint. 

[0006] While the molecular mechanism of imprinting is largely unknown, it appears 
that regions of chromosomes, rather than specific genes, are imprinted. Additionally, it has 
been determined that DNA methylation may play a role in this process. In vertebrates, 
methyl groups can be added to the carbon atom at position 5 in cytosine. These methyl 
groups are typically added when the dinucleotide CpG or groups of CpG (i.e., CpG islands) 
are present along a DNA sequence. CpG islands have primarily been observed in the 5* 
area of expressed genes, and, in particular, the 5' area of certain housekeeping genes (see. 
Bird et al.. Nature 321:209-213 (1986)). It has been hypothesized that DNA methylation 
plays a role in gene regulation by increasing or decreasing the affinity of regulatory DNA- 
binding proteins, such as CTCF (see, Watson et al., Molecular Biology of the Gene, Volume 
//: 3'"* Ed., The Benjamin/Cummings Publishing Company, Inc., Menlo, CA (1987)). 
[0007] The process of imprinting and DNA methylation can be understood by analyzing 
a conunonly studied imprinted gene cluster that is regulated by CTCF, which includes the 
closely linked imprinted genes HI 9 and Igf2. These genes are oppositely imprinted on each 
parental chromosome. Indeed, HI 9 is active on the maternal chromosome with Igf2 
remaining silent, while on the paternal chromosome, IgfZ is active and HI 9 is silent. The 
two genes share an enhancer region located downstream of HI 9. Some studies have shown 
that the imprinting control region (ICR) of HI 9 is a boundary element controlled by DNA 
methylation. For example, it is thought that the CTCF protein binds to the unmethylated 
maternal ICR, which prevents the promoters located in the Ig£2 gene from interacting with 
the enhancers downstream of the HI 9 gene. This results in transcriptional silencing of Igf2. 
If the paternal ICR is present and methylated, CTCF is prevented from binding. This allows 
the enhancers to contact the promoters of the paternal Igf2, allowing the gene to be 
transcribed. 
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[0008] Recent studies have indicated that abnormal imprinting could result in the 
activation of certain growth factors or the inactivation of tumor suppressor genes, both of 
which could result in the formation of cancer. Indeed, various epigenetic alterations have 
been associated with cancers, including global hypomethylation, hypomethylation of 
individual genes, and hypermethylation of CpG islands (see, Feinberg, PNAS, 98(2):392- 
394 (2001)). Thus, it would be beneficial to identify genes which, when abnormally 
imprinted, lead to the development of cancer, 

[0009] Accordingly, a need remains for the identification of genes and gene products 
which can be shown to have a strong association with cancer. Such genes and gene 
products can lead to the development of novel therapeutic applications, as well as to early, 
sensitive and accurate methods for detecting a cancer or a predisposition to a cancer in a 
mammal. Moreover, such methods would enable cUnicians to monitor the response of a 
mammal to a particular treatment with greater sensitivity and accuracy. The present 
invention provides such therapeutic applications and methods. These and other objects and 
advantages of the invention, as well as additional inventive features, will be apparent from 
the description of the invention provided herein. 



[0010] The present invention provides an isolated or purified nucleic acid molecule 
consisting essentially of a nucleotide sequence encoding hxmian BORIS or a fragment 
thereof comprising at least 1536 contiguous nucleotides, as well as an isolated or purified 
nucleic acid molecule consisting essentially of a nucleotide sequence encoding a non-human 
BORIS or a fragment thereof comprising at least 229 contiguous nucleotides and related 
vectors and cells comprising such vectors. 

[001 1] The invention also provides an isolated or purified polypeptide molecule 
consisting essentially of an amino acid sequence encoding human BORIS or a fragment 
thereof comprising at least 307 contiguous amino acids, as well as an isolated or purified 
polypeptide molecule consisting essentially of an amino acid sequence encoding a non- 
human BORIS or a fragment thereof comprising at least 21 contiguous amino acids, and 
related monoclonal antibody-producing cell lines and the monoclonal antibodies so 
produced. The amino acid sequences encoding human or non-human BORIS or fragments 
thereof can optionally be glycosylated, amidated, carboxylated, phosphorylated, esterified, 
N-acylated or converted into an acid addition salt and/or optionally dimerized or 
polymerized. 

[0012] Further provided is an isolated or purified nucleic acid molecule consisting 
essentially of a nucleotide sequence that is complementary to a nucleotide sequence 
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encoding human BORIS or a fragment thereof comprising at least 1536 contiguous 
nucleotides, as well as an isolated or purified nucleic acid molecule consisting essentially of 
a nucleotide sequence that is complementary to a nucleotide sequence encoding a non- 
human BORIS or a fragment thereof comprising at least 229 contiguous nucleotides and 
related vectors and host cells comprising such vectors. 

[00131 Still further provided by the invention is a method of diagnosing a cancer or a 
predisposition to a cancer in a male manmial. The method comprises detecting a nucleic 
acid molecule comprising a nucleotide sequence encoding BORIS or a polypeptide 
molecule comprising an amino acid sequence encoding BORIS in a test sample comprising 
somatic cells obtained from the male manunal. The detection of the nucleic acid or 
polypeptide molecule encoding BORIS in the test sample is indicative of the cancer or a 
predisposition to the cancer in the male mammal. 

[0014] The invention also provides a method of predicting a predisposition to a cancer 
in an offspring of a male manmial. The method comprises detecting either a mutation in a 
nucleic acid molecule comprising a nucleotide sequence encoding BORIS, a decreased level 
of a polypeptide molecule comprising an amino acid sequence encoding wild- type BORIS, 
or a mutation in a polypeptide molecule comprising an amino acid sequence encoding 
BORIS in a test sample comprising germ cells obtained from the male mammal. The 
detection of a mutation in the nucleic acid or polypeptide molecule encoding BORIS or a 
decreased level of wild-type BORIS is indicative of a predisposition to the cancer in the 
offspring of the male mammal. 

[00151 hi addition to a method of diagnosing a cancer or a predisposition to a cancer in 
a male manmial, the invention provides a method of diagnosing a cancer or a predisposition 
to a cancer in a female manm:ial. The method comprises detecting either of a nucleic acid 
molecule comprising a nucleotide sequence encoding BORIS or a polypeptide molecule 
comprising an amino acid sequence encoding BORIS in a test sample obtained from the 
female mammal. The detection of the nucleic acid or polypeptide molecule encoding 
BORIS in the test sample is indicative of the cancer or a predisposition ^to the cancer in the 
female manmial. 

[001 6] The invention further provides a method of prognosticating a cancer in a 
mammal and a method of assessing the effectiveness of treatment of a cancer in a manmial. 
In such methods, BORIS is a marker for the cancer. These methods comprise measuring the 
level of BORIS in a test sample comprising somatic cells obtained from the mammal. The 
level of BORIS in the test sample is indicative of the prognosis or the effectiveness of 
treatment of the cancer in the manmial wherein a decrease or no change in the level of 
BORIS over time is indicative of a positive prognosis or an effective treatment regimen, 
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and, alternatively, an increase in the level of BORIS over time is indicative of a negative 
prognosis or an ineffective treatment regimen. 

[0017] Still further provided by the invention is a method of treating prophylactically or 
therapeutically a mammal for a cancer. In such a method, the cancer is due to the presence 
of a nucleic acid molecule comprising a nucleotide sequence encoding BORIS or a 
polypeptide molecule comprising an amino acid sequence encoding BORIS. The method 
comprises providing an inhibitor of BORIS to the mammal in an amount sufficient to 
prophylactically or therapeutically treat the mammal for the cancer. In this regard, the 
present invention also provides a composition comprising an inhibitor of BORIS and a 
carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] Fig. 1 A represents the nucleotide sequence coiresponding to human BORIS, 
SEQIDNOrl. 

[00191 Fig- IB represents the nucleotide sequence corresponding to murine BORIS, 

SEQ ID NO:3. 

[0020] Fig. 2A represents the amino acid sequence corresponding to human BORIS, 
SEQ ID NO:2. 

[0021] Fig. 2B represents the amino acid sequence corresponding to murine BORIS, 
SEQ ID N0;4. 

[0022] Fig. 3 A illustrates the human BORIS cDNA sequence and its conceptual 
translation with the 1 1 zinc finger regions being double-underlined and indicated as ZF 1- 
11. 

[0023] Fig. 3B illustrates the best-fit alignment of the human CTCF and BORIS 
polypeptides produced by the GCG-package of programs with zero-penahy for the gap 
extension with conserved zing finger regions highlighted and indicated as ZF 1-1 1. 
[0024] Fig. 3C illustrates the best-fit alignment of the human and mxuine BORIS 
polypeptides produced by the GCG-package of programs with zero-penalty for the gap 
extension with conserved zinc finger regions highlighted and indicated as ZF l-U. 
[0025] Fig. 3D illustrates the murine BORIS partial cDNA sequence and its conceptual 
translation with the 1 1 zinc finger regions being double-underlined and indicated as ZF- 1- 
IL 

[0026] Fig. 4A illustrates pairs of primers corresponding to conserved CTCF cDNA 
sequences in vertebrates used to identify human BORIS. 

[0027] Fig. 4B illustrates pairs of primers corresponding to the sequence homology of 
human BORIS and murine CTCF used to identify murine BORIS. 
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DETAILED DESCRIPTION OF THE INVENTION 
[0028] The invention provides an isolated or purified nucleic acid molecule consisting 
essentially of a nucleotide sequence encoding human BORIS or a fragment thereof 
comprising at least 1536 contiguous nucleotides. Preferably, the isolated or purified nucleic 
acid molecule (i) encodes the amino acid sequence of SEQ ID N0:2 or a fragment thereof 
comprising at least 307 contiguous amino acids, (ii) consists essentially of the nucleotide 
sequence of SEQ ID N0:1 or a fragment thereof comprising at least 1536 contiguous 
nucleotides, (iii) hybridizes under highly stringent conditions to an isolated of purified 
nucleic acid molecule consisting essentially of the nucleotide sequence that is 
complementary to SEQ ED NO:l or a fragment thereof, or (iv) shares 45% or more identity 
with SEQ ID NO: 1, 

[0029] While the isolated or purified nucleic acid molecule of the invention consists 
essentially of a nucleotide sequence encoding human BORIS or a fragment thereof 
comprising at least 1536 contiguous nucleotides, larger fragments of human BORIS are also 
contemplated. For example, it is suitable for the isolated or purified nucleic acid molecule 
of the invention to consist essentially of a nucleotide sequence encoding human BORIS or a 
fragment thereof comprising at least 1550 contiguous nucleotides, at least 1560 contiguous 
nucleotides, at least 1570 contiguous nucleotides, at least 1580 contiguous nucleotides, at 
least 1590 contiguous nucleotides, or even at least 1600 contiguous nucleotides. Still larger 
fragments of human BORIS are also contemplated, such as fragments comprising at least 
1700 contiguous nucleotides, at least 1800 contiguous nucleotides, at least 1900 contiguous 
nucleotides, or even at least 2000 contiguous nucleotides. Generally, any size fragment is 
contemplated as long as the fragment comprises contiguous nucleotides spanning 45% or 
more, 50% or more, or even 55% or more of the nucleic acid molecule consisting essentially 
of a nucleotide sequence encoding human BORIS. 

[0030] The invention also provides an isolated or purified polypeptide molecule 
consisting essentially of an amino acid sequence encoding human BORIS or a fragment 
thereof comprising at least 307 contiguous amino acids, either one of which is optionally 
glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted 
into an acid addition salt and/or optionally dimerized or polymerized. Preferably, the 
isolated or purified polypeptide molecule (i) is encoded by the nucleotide sequence of SEQ 
ID NO:l or a fragment thereof comprising at least 921 contiguous nucleotides, (ii) consists 
essentially of the amino acid sequence of SEQ ID N0:2 or a fragment thereof comprising at 
least 307 contiguous amino acids, or (iii) shares 47% or more identity with SEQ ID NO:2. 
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[0031] While the isolated or purified polypeptide molecule on the invention consists 
essentially of an amino acid sequence encoding human BORIS or a fragment thereof 
comprising at least 307 contiguous amino acids, larger fragments of human BORIS are also 
contemplated. For example, it is suitable for the isolated or purified polypeptide molecule 
of the invention to consist essentially of an amino acid sequence encoding human BORIS or 
a fragment thereof comprising at least 310 contiguous amino acids, at least 320 contiguous 
amino acids, at least 330 contiguous amino acids, at least 340 contiguous amino acids, or 
even at least 350 contiguous amino acids. Still larger fragments of human BORIS are also 
contemplated, such as fragments comprising at least 400 contiguous amino acids, at least 
450 contiguous amino acids, at least 500 contiguous amino acids, or even at least 550 
contiguous amino acids. Generally, any size fragment is contemplated as long as the 
fragment comprises contiguous amino acids spanning 47% or more, 50% or more, or even 
55% or more of the polypeptide molecule consisting essentially of an amino acid sequence 
encoding human BORIS. 

[0032] Also provided by the invention is a nucleic acid molecule consisting essentially 
of a nucleotide sequence that is complementary to a nucleotide sequence encoding human 
BORIS or a fragment thereof comprising at least 1536 contiguous nucleotides. Preferably, 
such an isolated or purified nucleic acid molecule (i) is complementary^o a nucleotide 
sequence encoding the amino acid sequence of SEQ ID NO:2, (ii) is. complementary to the 
nucleotide sequence of SEQ ED NO:l or a fragment thereof comprising at least 1536 
contiguous nucleotides, (iii) hybridizes under highly stringent conditions to an isolated or 
purified nucleic acid molecule consisting essentially of SEQ ID NO:l or a fragment thereof, 
or (iv) shares 45% or more identity with the nucleotide sequence that is complementary to 
SEQIDNO:!. - . 

[0033] Other forms of BORIS are also contemplated in the invention. In that respect, 
the invention provides an isolated or purified nucleic acid molecule consisting essentially of 
a nucleotide sequence encoding a non-human BORIS or a fragment thereof comprising at 
least 229 contiguous nucleotides. Preferably, the isolated or purified nucleic acid molecule 
(i) encodes the amino acid sequence of SEQ ID N0:4 or a fragment thereof comprising at 
least 21 contiguous amino acids, (ii) consists essentially of the nucleotide sequence of SEQ 
ID NO:3 or a fragment thereof comprising at least 229 contiguous nucleotides, (iii) 
hybridizes under moderately stringent conditions to an isolated or purified nucleic acid 
molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID 
NO:3 or a fragment thereof, or (iv) shares 23% or more identity with SEQ ED NO:l. 
[0034] While the isolated or purified nucleic acid molecule of the invention consists 
essentially of a nucleotide sequence encoding a non-human BORIS or a fiugment thereof 
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comprising at least 229 contiguous nucleotides, larger fragments of human BORIS are also 
contemplated. For example, it is suitable for the isolated or purified nucleic acid molecule 
of the invention to consist essentially of a nucleotide sequence encoding a non-human 
BORIS or a fragment thereof comprising at least 235 contiguous nucleotides, at least 250 
contiguous nucleotides, at least 260 contiguous nucleotides, at least 270 contiguous 
nucleotides, at least 280 contiguous nucleotides, or even at least 290 contiguous nucleotides. 
Still larger fragments of a non-human BORIS are also contemplated, such as fragments 
comprising at least 300 contiguous nucleotides, at least 400 contiguous nucleotides, at least 
500 contiguous nucleotides, or even at least 600 contiguous nucleotides. Generally, any 
size fragment is contemplated as long as the fragment comprises contiguous nucleotides 
spanning 10% or more, 20% or more, or even 30% or more of the nucleic acid molecule 
consisting essentially of a nucleotide sequence encoding a non-human BORIS. 
[0035] The invention also provides an isolated or purified polypeptide molecule 
consisting essentially of an amino acid sequence encoding a non-human BORIS or a 
fragment thereof comprising at least 21 contiguous amino acids, either one of which is 
optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or 
converted into an acid addition salt and/or optionally dimerized or polymerized. Preferably, 
the isolated or purified polypeptide molecule (i) is encoded by the nucleotide sequence of 
SEQ ID NO:3 or a fragment thereof comprising at least 63 contiguous nucleotides, (ii) 
consists essentially of the amino acid sequence of SEQ ID N0:4 or a fragment thereof 
comprising at least 21 contiguous amino acids, or (iii) shares 40% or more identity with 
SEQ ID NO:4 

[0036] While the isolated or purified polypeptide molecule on the invention consists 
essentially of an amino acid sequence encoding a non-human BORIS or a fragment thereof 
comprising at least 21 contiguous amino acids, larger fragments of a non-human BORIS are 
also contemplated. For example, it is suitable for the isolated or purified polypeptide 
molecule of the invention to consist essentially of an amino acid sequence encoding a non- 
human BORIS or a fragment thereof comprising at least 25 contiguous amino acids, at least 
30 contiguous amino acids, at least 35 contiguous amino acids, at least 40 contiguous amino 
acids, or even at least 45 contiguous amino acids. Still larger fragments of a non-human 
BORIS are also contemplated, such as fragments comprising at least 50 contiguous amino 
acids, at least 55 contiguous amino acids, at least 60 contiguous amino acids, or even at 
least 65 contiguous amino acids. Generally, any size fragment is contemplated as long as 
the fragment comprises contiguous amino acids spanning 5% or more, 10% or more, or 
even 15% or more of the polypeptide molecule consisting essentially of an amino acid 
sequence encoding a non-human BORIS. 
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[0037] Also provided by the invention is a nucleic acid molecule consisting essentially 
of a nucleotide sequence that is complementary to a nucleotide sequence encoding a non- 
human BORIS or a fragment thereof comprising at least 229 contiguous nucleotides. 
Preferably, such an isolated or purified nucleic acid molecule (i) is complementary to a 
nucleotide sequence encoding the amino acid sequence of SEQ ID N0:4, (ii) is 
complementary to the nucleotide sequence of SEQ ID NO:3 or a fragment thereof 
comprising at least 229 contiguous nucleotides, (iii) hybridizes under moderately stringent 
conditions to an isolated or purified nucleic acid molecule consisting essentially of SEQ ID 
NO:3 or a fragment thereof, or (iv) shares 23% or more identity with the nucleotide 
sequence that is complementary to SEQ ED N0:3. 

[0038] It will be understood that a non-human BORIS can represent any organism other 
than human. Typically, the organism is a mammal, such as a rat or mouse. 
[0039] By "isolated" is meant the removal of a nucleic acid or polypeptide molecule 
from its natiu'al environment. By "purified" is meant that a given nucleic acid or 
polypeptide molecule, whether one that has been removed from nature or synthesized and/or 
amplified under laboratory conditions, has been increased in purity, wherein "purity" is a 
relative term, not "absolute purity." A "nucleic acid molecule" is intended to encompass a 
polymer of DNA or RNA, (i.e., a polynucleotide), which can be single-stranded or double- 
stranded and which can contain non-natural or altered nucleotides. Similarly, a 
"polypeptide molecule" is intended to encompass a linear sequence of amino acids (i.e., a 
primary protein stmcture) but also can include secondary, tertiary, and quaternary protein 
structiures, all of which can contain non-natural or altered amino acids. 
[0040] With respect to the above isolated or purified nucleic acid molecules, it is 
preferred that no insertions, deletions, inversions and/or substitutions are present in the 
nucleic acid molecule. Such a nucleic acid molecule will code for a "wild-type*' BORIS. 
However, it is suitable for the above isolated or purified nucleic acid molecules to comprise 
one or more insertions, deletions, inversions and/or substitutions. Such a nucleic acid 
molecule will code for a ^Variant BORIS". In this respect, the invention provides an. 
isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence 
encoding a variant human BORIS or a fragment thereof comprising at least 1536 contiguous 
nucleotides. The invention also provides an isolated or purified nucleic acid molecule 
consisting essentially of a nucleotide sequence encoding a variant non-human BORIS or a 
fragment thereof comprising at least 229 contiguous nucleotides. 

[0041] Similarly, with respect to the above isolated or purified polypeptide molecules, it 
is preferred that no insertions, deletions, substitutions and/or abnormal post-translational 
modifications are present in the polypeptide molecule. Such a polypeptide molecule will 
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code for a wild-type BORIS. However, it is suitable for the above isolated or purified 
polypeptide molecules to comprise one or more insertions, deletions, substitutions and/or 
abnormal post-translational modifications. Such a polypeptide molecule will code for a 
variant BORIS. In this respect, the invention provides an isolated or purified polypeptide 
molecule consisting essentially of an amino acid sequence encoding a variant human 
BORIS or a fragment thereof comprising at least 307 contiguous amino acids. The 
invention also provides an isolated or purified polypeptide molecule consisting essentially 
of an amino acid sequence encoding a variant non-human BORIS or a fragment thereof 
comprising at least 21 contiguous amino acids. 

[0042] Preferably, the variant BORIS (human or non-human) will not differ functionally 
from the corresponding wild-type BORIS. For example, any insertions, deletions, 
inversions and/or substitutions contained within the nucleic acid molecule comprising a 
nucleotide sequence encoding the variant BORIS will not (i) result in the introduction of a 
frame-shift mutation, (2) interfere with the ability of the promoter region to direct the 
transcription of the nucleotide sequence, or (3) interfere with the ability of the 
corresponding RNA transcript to be translated into a protein. It is also preferred that the one 
or more substitution(s) do(es) not result in a change in an amino acid of BORIS. 
Alternatively, and also preferred, is that the one or more substitution(s) result(s) in the 
substitution of an amino acid with another amino acid of approximately equivalent size, 
shape and charge. 

[0043] If desired, the polypeptide molecules of the invention (including variant 
polypeptide molecules) can be modified, for instance, by glycosylation, amidation, 
carboxylation, or phosphorylation, or by the creation of acid addition salts, amides, esters, in 
particular C-terminal esters, and N-acyl derivatives of the polypeptide molecules of the 
invention. The polypeptide molecules also can be dimerized or polymerized. Moreover, the 
polypeptide molecules can be modified to create polypeptide derivatives by forming covalent 
or non-covalent complexes with other moieties in accordance with methods known in the art. 
Covalently-bound complexes can be prepared by linking the chemical moieties to functional 
groups on the side chains of amino acids comprising the polypeptides, or at the N- or C- 
terminus. 

[0044] Also with respect to the above, '*will not differ functionally from" is intended to 
mean that the variant BORIS will have activity characteristic of the wild-type BORIS. 
However, the variant BORIS can be more or less active than the wild-type BORIS as 
desired in accord£Lnce with the present invention. 

[0045] The phrase "hybridizes to" refers to the selective binding of a single-stranded 
nucleic acid probe to a single-stranded target DNA or RNA sequence of complementary 
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sequence when the target sequence is present in a preparation of heterogeneous DNA and/or 
RNA. "Stringent conditions** are sequence-dependent and will be different in different 
circumstances. Generally, stringent conditions are selected to be about 20 lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. 
The Tm is the temperature (imder defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. 

[0046] For example* under stringent conditions, as that term is understood by one skilled 
in the art, hybridization is preferably carried out using a standard hybridization buffer at a 
temperature ranging from about 50 to about 75 °C, even more preferably from about 60 °C 
to about 70 **C, and optimally from about 65 "^C to about 68 **C. Altemately, formamide can be 
included in the hybridization reaction, and the temperature of hybridization can be reduced to 
preferably from about 35 **C to about 45 **C, even more preferably from about 40 ^'C to about 
45 **C, and optimally to about 42 **C. Desirably, formamide is included in the hybridization 
reaction at a concentration of from about 30% to about 50%, preferably from about 35% to 
about 45%, and optimally at about 40%. Moreover, optionally, the hybridized sequences are 
washed (if necessary to reduce non-specific binding) under relatively highly stringent 
conditions, as that term is understood by those skilled in the art. For instance, desirably, the 
hybridized sequences are washed one or more times using a solution comprising salt and 
detergent, preferably at a temperature of from about 50 **C to about 75 **C, even more 
preferably at from about 60 **C to about 70 **C, and optimally from about 65 ^^C to about 68 **C. 
Preferably, a salt (e.g., such as sodium chloride) is included in the wash solution at a 
concentration of from about 0.01 M to about 1.0 M. Optimally, a detergent (e.g., such as 
sodium dodecyl sulfate) is also included at a concentration of from about.-0.01% to about 1 .0%. 
[0047] In view of the above, "highly stringent conditions*' preferably allow for from 
about 25% to about 5% mismatch, more preferably from about 15% to about 5% mismatch, 
and most preferably from about 10% to about 5% mismatch. "Moderately stringent 
conditions" preferably allow for from about 40% to about 15% mismatch, more preferably 
from about 30% to about 15% mismatch, and most preferably from about 20% to about 
15% mismatch. "Low stringent conditions'* preferably allow for from about 60% to about 
35% mismatch, more preferably from about 50% to about 35% mismatch, and most 
preferably from about 40% to about 35% mismatch. With respect to the preceding ranges 
of mismatch, 1% mismatch corresponds to one degree decrease in the melting temperature. 
It is generally appreciated that the stringent conditions can be manipulated by adjusting the 
concentration of formamide in the hybridization reaction. For example, conditions can be 
rendered more stringent by the addition of increasing amounts of form^iide. 
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[0048] The above isolated or purified nucleic acid and polypeptide molecules also can . 
be characterized in terms of "percentage of sequence identity." In this regard, a given 
nucleic acid or polypeptide molecule as described above can be compared to a nucleic acid 
or polypeptide molecule encoding BORIS by optimally aligning the nucleotide or amino 
acid sequences over a comparison window, wherein the portion of the nucleotide or amino 
acid sequence in the comparison window may comprise additions or deletions (i.e., gaps) as 
compared to the reference sequence, which does not comprise additions or deletions, for 
optimal alignment of the two sequences. The percentage of sequence identity is calculated 
by determining the number of positions at which the identical nucleotide or amino acid 
occurs in both sequences, i.e., the number of matched positions, dividing the number of 
matched positions by the total number of positions in the window of comparison, and 
multiplying the result by 100 to yield the percentage of sequence identity. Optimal 
aligrunent of sequences for comparison may be conducted by computerized 
implementations of known algorithms (e.g., GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., 
Madison, WI; BlastN and BlastP available from the National Center for Biotechnology 
Information, Bethesda, MD; or ClustalW available from the European Bioinformatics 
Institute, Cambridgeshire, UK), or by inspection. Generally, in regards to. human BORIS, 
the isolated or purified nucleic acid molecule consists essentially of a nucleotide sequence 
encoding human BORIS which shares 45% or more identity with SEQ ID N0:1, and the 
isolated or purified polypeptide molecule consists essentially of an amino acid sequence 
encoding human BORIS which shares 47% or more identity with SEQ ID NO:2. Similarly, 
in regards to a non-human BORIS, the isolated or purified nucleic acid molecule consists 
essentially of an nucleotide sequence encoding a non-human BORIS which shares 47% or 
more identify with SEQ ID NO:3, and the isolated or purified polypeptide molecule consists 
essentially of an amino acid sequence encoding a non-human BORIS which shares 40% or 
more identify with SEQ ID N0:4. It will be understood, however, that the percentage of 
sequence identity may vary slightly when using the different computerized programs since 
these programs implement different algorithms. The invention is intended to cover such 
variations but will generally share the percentage of sequence identities above using at least 
one computerized program and its respective algorithm. 

[00491 The present invention also provides a vector comprising an above-described 
isolated or purified nucleic acid molecule. A nucleic acid molecule as described above can 
be cloned into any suitable vector and can be used to transform or transfect any suitable 
host. The selection of vectors and methods to construct them are commonly known to 
persons of ordinary skill in the art and are described in general technical references (see, in 
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general, "Recombinant DNA Part D," Methods in Enzymology, Vol. 153, Wu and 
Grossman, eds.. Academic Press (1987)). 

10050) Suitable vectors include those designed for propagation and expansion or for 
expression or both. Examples of suitable vectors include plasmids, phagemids, cosmids, 
viruses, and other vehicles derived from viral or bacterial sources. Preferably, the vector is 
a viral vector and is selected from the group consisting of an adenovirus, adeno-associated 
virus, retroviruses, SV40-type viruses, polyoma viruses, Epstein Barr viruses, 
papillomaviruses, herpes virus, vaccinia virus and polio virus. Most preferably, the vector 
is an adenoviral vector. 

[0051] When an adenoviral vector is used in the context of the present invention, the 
adenoviral vector can be derived from any serotype of adenovirus. Adenoviral stocks that 
can be employed as a source of adenovirus can be amplified from the adenoviral serotypes 1 
through 51, which are currently available from the American Type Culture Collection 
(ATCC, Manassas, VA), or fix)m any other serotype of adenovirus available from any other 
source. For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, and 31), 
subgroup B (e.g., serotypes 3,7, 11, 14, 16, 21, 34, and 35), subgroup C (e.g., serotypes 1, 
2, 5, and 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 22-30, 32, 33, 36-39, 
and 42-47), subgroup E (serotype 4), subgroup F (serotypes 40 and 41), or any other 
adenoviral serotype. Preferably, however, an adenovirus is of serotype 2,. 5 or 9. However, 
non-group C adenoviruses can be used to prepare adenoviral vectors for delivery of one or 
more non-native nucleic acid sequences to a desired tissue. Preferred adenoviruses used in 
the construction of non-group C adenoviral vectors include Ad 12 (group A), Ad7 (group B), 
Ad30 and Ad36 (group D), Ad4 (group E), and Ad41 (group F). Non-group C adenoviral 
vectors, methods of producing non-group C adenoviral vectors, and niethods of using non- 
group C adenoviral vectors are disclosed in, for example, U.S. Patents 5,801.030; 
5,837,51 1; and 5,849,561 and International Patent Applications WO 97/12986 and WO 
98/53087. 

[0052J In preferred embodiments, the adenoviral vector of the present invention is 
deficient in one or more replication-essential gene functions. Regions contained within the 
adenoviral genome which are essential for replication include Ela, Elb, E2, E4, and L1-L5. 
By "deficient" is meant a disruption contained within at least one of the above-mentioned 
regions such that the gene product encoded by the region is produced in a reduced amount 
as compared to normal levels. Suitable disruptions include point mutations, substitutions, 
deletions, insertions, and inversions. Typically, the adenoviral vector is deficient in one or 
more replication-essential gene functions of the Ela, Elb, E3 and/or E4 region. 
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(0053] A nucleic acid sequence encoding a marker protein, such as green fluorescent . 
protein or luciferase also can be present in the vector. Such marker proteins are useful in 
vector construction and determining vector migration. Marker proteins also can be used to 
determine points of injection in order to efficiently space injections of a vector composition 
to provide a widespread area of treatment, if desired. Alternatively, a nucleic acid sequence 
encoding a selection factor, which also is useful in vector construction protocols, can be part 
of the adenoviral vector. 

[0054] Negative selection genes may be incorporated into any of the above-described 
vectors. A preferred embodiment is an HSV tk gene cassette (Zjilstra et al., Nature, 342: 
435 (1989); Mansour et al.. Nature, 336: 348 (1988); Johnson et al.. Science, 245: 1234 
(1989): Adair et al., PNAS, 86: 4574 (1989); Capecchi, M., Science, 244: 1288 (1989), 
incorporated herein by reference) operably linked to a viral promoter in a viral vector. The 
tk expression cassette (or other negative selection expression cassette) is inserted into the 
viral genome, for example, as a replacement for a substantial deletion of a non-essential 
viral gene. Other negative selection genes will be apparent to those of skill in the art. 
[0055J The vector of the present invention can comprise a native or non-native 
regulatory sequence operably linked to an isolated or purified nucleic acid molecule as 
described above. If more than one nucleotide sequence is included in the nucleic acid 
molecule, each sequence can be operably linked to its own regulatory sequence. The 
"regulatory sequence" is typically a promoter sequence or promoter-enhancer combination, 
which facilitates the efficient transcription and translation of the nucleic acid to which it is 
operably linked. The regulatory sequence can, for example, be a mammalian or viral 
promoter, such as a constitutive or inducible promoter. Exemplary viral promoters which 
function constitutively in eukaryotic cells include, for example, promoters from the simian 
virus, papilloma virus, adenovirus, human immunodeficiency virus, Rous sarcoma virus, 
cytomegalovirus, Moloney leukemia virus and other retroviruses, and Herpes simplex virus. 
Other constitutive promoters are known to those of ordinary skill in the art. The promoters^ 
useful as regulatory sequences of the invention also include inducible promoters. Inducible 
promoters are expressed in the presence of an inducing agent. For example, the 
metallothionein promoter is induced to promote transcription and translation in the presence 
of certain metal ions. Other inducible promoters are known to those of ordinary skill in the 
art and can be used in the context of the invention, when desired. The selection of 
promoters, e.g., strong, weak, inducible, tissue-specific and developmental-specific, is 
within the skill in the art. Similarly, the combining of a nucleic acid molecule as described 
above with a promoter is also within the skill in the art. 
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[0056] The term "operably linked" as used herein can be defined when a nucleic acid 
molecule and the regulatory sequence are covalently linked in such a way as to place the 
expression of the nucleotide coding sequence under the influence or control of the 
regulatory sequence. Thus, a regulatory sequence would be operably linked to a nucleic 
acid molecule if the regulatory sequence were capable of effecting transcription of that 
nucleic acid molecule such that the resulting transcript is translated into the desired protein 
or polypeptide. 

[00571 The present invention further provides a cell (i.e., a host cell) comprising an 
isolated or purified nucleic acid molecule or a vector as described above. Examples of host 
cells include, but are not limited to, a prokaryotic or eurkaryotic host cell. Prokaryotic cells 
include those derived fi-om E. coli. B. subtilis, P, aerugenosa, S. cerevisiae, and,//, crassa. 
Preferably, the host cell is derived from a manimal, such as a human. 
[0058] Cell lines producing monoclonal antibodies also are contemplated in the invention. 
Such "hybridoma cell lines" desirably produce a monoclonal antibody that is specific for 
BORIS. Methods of making hybridomas are known in the art (see, e.g., Roitt I., 
Immunology, Ed., Mosby, NY (1996)). Thus, the present invention also provides a 
monoclonal antibody produced by the hybridoma cell line. Typically, the monoclonal 
antibody will be specific for a region of BORIS or a region of a variant BORIS, wherein:the ^ 
region comprises any region other than one encoding a conserved zinc finger region (e.g., other 
than one spanning amino acids 259-568) of the particular targeted BORIS. Typically, the 
region will be the N- or C- terminal portion of BORIS, which are unique (i.e., not conserved) 
regions in their respective organisms. Alternatively, the antibody can be specific for a zinc 
finger region of BORIS. Such an antibody will have a greater affinity for zinc finger regions 
of BORIS as compared to other proteins containing similar zinc finger regions (e.g., CTCF); 
thus being able to distinguish between the two molecules. Monoclonal antibodies of the 
invention can be employed for both diagnostic and therapeutic applications as they are 
described herein. 

[0059] BORIS is a DNA-binding protein that has been mapped to the cancer-associated 
region 20ql3 within the human genome. It has been shown to contain the same exons 
encoding the 1 1 zinc finger domain as mammalian CTCF genes while being completely 
divergent at the amino and carboxy termini. This indicates that the nucleoprotein 
complexes generated by BORIS and CTCF bind to the same target DNA sites but are likely 
to have distinct fiinctions. BORIS and CTCF are expressed in a mutually exclusive pattern 
that correlates with re-setting of methylation marks during male germ cell differentiation, 
thus suggesting that BORIS directs epigenetic reprogramming at CTCF target sites. Male 
germ cells in which reprogramming of imprinting occurs is positive for BORIS but negative 
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for CTCF, which provides the opportunity for BORIS to set paternally imprinted insulator 
sites that are later read by CTCF. The expression of BORIS in spermatocytes could reflect 
a demethylation of its promoter. Alternatively, BORIS could be associated v/ith 
demethylases that participate in the erasure of methylation marks. It is also possible that 
BORIS activation is intimately linked with initiating de novo methylation. In that respect, it 
is possible that BORIS interacts with the Suv39h2 histone H3 methyltransferase, which 
marks chromatin for de novo methylation and is co-expressed with BORIS. In any event, at 
later stages of spermatogenesis, BORIS is silenced while CTCF is re-activated. 
(00601 It has been determined that BORIS belongs to the "cancer-testis" (CT) gene 
family because it is aberrantly activated in substantial proportions of different cancers. The 
CT gene family combines genes that are normally expressed only in testis but frequently 
activated in different malignancies. Most, but not all, of CT- family genes encode human 
tumor antigens recognized by T cells. These genes include the MAGE, GAGE, and 
LAGE/ESO-1 CT-subgrpups. A few recently discovered CT antigens are nuclear factors. 
However, BORIS is a unique member of the CT gene family because, in contrast to all other 
CT-genes, it has a somatic counterpart, CTCF, that has anti-proliferative properties and 
shares with BORIS homologous ZFs capable of mediating binding to overlapping sets of 
DNA targets. Abnormal function of BORIS due to mutations in the nucleotide sequence 
encoding it, such as the DNA-recognition domain, could result in an abnormal pattern of 
gene imprinting, a phenomenon that is known to be frequently associated with different 
cancers. Moreover, since BORIS has been shown to share the same unique DNA-binding 
sequences as CTCF, abnormal activation of BORIS in somatic cells may compete with the 
normal function of CTCF, leading to uncontrolled cell proliferation. 
[0061] In view of the above, the invention provides a method of diagnosing a cancer or 
a predisposition to a cancer in a male mammal. One such method comprises detecting 
either (i) a nucleic acid molecule comprising a nucleotide sequence encoding BORIS or (ii) 
a polypeptide molecule comprising an amino acid sequence encoding BORIS in a test 
sample comprising somatic cells obtained from the male mammal. The detection of (i) or 
(ii) in the test sample is indicative of the cancer of a predisposition to the cancer in the 
mammal. 

10062] As indicated above, abnormal imprinting has been shown to have a relationship 
with the development of cancer. Accordingly, the invention provides a method of 
predicting a predisposition to a cancer in an offspring of a male mammal comprising 
detecting either (i) a mutation in a nucleic acid molecule comprising a nucleotide sequence 
encoding BORIS, (ii) a decreased level of a polypeptide molecule comprising an amino acid 
sequence encoding wild-type BORIS, or (iii) a mutation in a polypeptide molecule 
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comprising an amino acid sequence encoding BORIS in a test sample comprising germ cells 
obtained fix>m the male mammal. The detection of (i), (ii), or (iii) in the test sample is 
indicative of the cancer or a predisposition to the cancer in the offspring of the male 
mammal. 

[0063] BORIS is generally expressed only in the germ cells of males. Thus, the 
activation of BORIS in any cell type contained in a female mammal is abnormal. 
Accordingly, a female mammal also can be diagnosed with a cancer or a predisposition to a 
cancer utilizing a method of the invention. Such a method comprises detecting either (i) a 
nucleic acid molecule comprising a nucleotide sequence encoding BORIS or (ii) a 
polypeptide molecule comprising an amino acid sequence encoding BORIS in a test sample 
obtained from the female mammal. The detection of (i) or (ii) in the test sample is 
indicative of the cancer or a predisposition to the cancer in the female mammal. 
[0064] The test sample used in conjunction with the invention can be any of those 
typically used in the art and will vary depending on the condition of the mammal (i.e., 
whether or not a cancer has developed in the mammal). For example, the test sample can be 
tissue, which tissue comprises somatic cells. If the test sample is obtained from a male 
mammal, the test sample can be sperm cells or cells giving rise to spenn. Typically, the 
tissue is metastatic (e.g., cancerous) and is obtained by means of a biopsy. Such tissue can 
include bone marrow, lymph nodes, skin, and any organ that may develop;canc€rous cells. 
If the test sample is obtained from a male mammal, the test sample can be taken from the 
testes of the male mammal. Preferably, however, the test sample is one which is least 
invasive to the mammal, such as a blood sample. 

[0065] A number of assays are contemplated for use in analyzing a given test sample of 
the present invention. As used herein, the term "assay" can be defined as any quantitative 
or qualitative analysis of a nucleic acid or polypeptide molecule that is known in the art. A 
variety of these assays are contemplated for use in the invention, many of which are 
described in Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2"** Ed., Cold 
Spring Harbor Press, Cold Spring Harbor, NY, (1989). Microarrays, such as those 
described in U.S. Patent Nos. 6,197,506 and 6,040,138, also can be used to detect and 
quantify BORIS. It will be understood that the type of assay used will depend on whether a 
nucleic acid or polypeptide molecule is being assayed for and whether the detection or 
quantification of the nucleic acid or polypeptide molecule is sought. 
[0066] When a nucleic acid molecule encoding a nucleotide sequence encoding BORIS 
is assayed for, various assays can be used to detect or to measure the level of BORIS in a 
given test sample. For example, when only the detection of BORIS or the identification of a 
mutation in BORIS is necessary to diagnose effectively the cancer or a predisposition to the 
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cancer, assays including PCR and microanray analysis can be used. In certain embodiments 
it may be necessary to detect the quantity of BORIS present. In such instances, it will be 
advantageous to use various hybridization techniques known in the art that can effectively 
measure the level of BORIS in a test sample. When BORIS comprises DNA, such 
hybridization techniques can include, for example, Southern hybridization (i.e., a Southern 
blot), in situ hybridization and microanray analysis. Similarly, when BORIS comprises 
RNA, Northern hybridization (i.e., a Northern blot), in situ hybridization and microanray 
analysis are contemplated. 

{0067] It will be understood that, in such assays, a nucleotide sequence that specifically 
binds to or associates with a nucleic acid molecule comprising a nucleotide sequence 
encoding BORIS, whether DNA or RNA, can be attached to a label for determining 
hybridization. A wide variety of appropriate labels are known in the art, including 
fluorescent, radioactive, and enzymatic labels as well as ligands, such as avidin/biotin, 
which are capable of being detected. Preferably, a fluorescent label or an enzyme tag, such 
as urease, alkaline phosphatase or peroxidase, is used instead of a radioactive or other 
environmentally undesirable label. In the case of enzyme tags, colorimetric indicator 
substrates are known which can be employed to provide a detection means visible to the 
himian eye or spectrophotometrically to identify specific hybridization with complementary 
BORIS nucleic acid-containing samples. ' 
[0068] When a nucleic acid molecule comprising a nucleotide sequence encoding 
BORIS is amplified in the context of a diagnostic application, the nucleic acid used as a 
template for amplification is isolated fi'om cells contained in the test sample, according to 
standard methodologies (see, e.g., Sambrook et al., (1989), supra). The nucleic acid can be 
genomic DNA or fi^ctionated or whole cell RNA. Where RNA is used, it can be desirable 
to convert the RNA to cDNA. 

10069] In a typical amplification procedure, pairs of primers that selectively hybridize to 
nucleic acids corresponding to BORIS are contacted with the nucleic acid under conditions 
that permit selective hybridization. Once hybridized, the nucleic acid-primer complex is 
contacted with one or more enzymes that facilitate template-dependent nucleic acid 
synthesis. Multiple rounds of amplification, also referred to as "cycles," are conducted until 
a sufficient amount of amplification product is produced. 

(0070] Various template-dependent processes are available to amplify BORIS present in 
a given test sample. As with the various assays, a number of these processes are described 
in Sambrook et al. (1989), supra. One of the best-known amplification methods is the 
polymerase chain reaction (PCR). Similarly, a reverse transcriptase PCR (RT-PCR) can be 
used when it is desired to convert mRNA into cDNA. Alternative methods for reverse 
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transcription utilize thermostable DNA polymerases and are described in WO 90/07641, for 
example. 

[0071 J Other methods for amplification include the ligase chain reaction (LCR), which 
is disclosed in U.S. Patent No. 4,883,750; isothermal amplification, in which restriction 
endonucleases and ligases are used to achieve the amplification of target molecules that 
contain nucleotide 5'-[alpha-thio]-triphosphates in one strand (Walker et al., Proc, Natl 
Acad. ScL USA 89: 392-396 (1992)); strand displacement amplification (SDA), which 
involves multiple rounds of strand displacement and synthesis, i.e., nick translation; and 
repair chain reaction (RCR), which involves annealing several probes throughout a region 
targeted for amplification, followed by a repair reaction in which only two of the four bases 
are present. The other two bases can be added as biotinylated derivatives for easy detection. 
Target-specific sequences also can be detected using a cyclic probe reaction (CPR). In 
CPR, a probe having 3' and 5' sequences of non-specific DNA and a middle sequence of 
specific RNA is hybridized to DNA, which is present in a sample. Upon hybridization, the 
reaction is treated with RNase H, and the products of the probe are identified as distinctive 
products, which are released after digestion. The original template is annealed to another 
cycling probe and the reaction is repeated. A number of other amplification processes are 
contemplated; however, the invention is not limited as to which method is used. 
[0072] Following amplification of BORIS, it can be desirable to separate the 
amplification product fi-om the template and the excess primer for the purpose of 
determining whether specific amplification has occurred. In one embodiment, amplification 
products are separated by agarose, agarose-acrylamide or polyacrylamide gel 
electrophoresis using standard methods. See Sambrook et al. (1989), supra. 
[0073] Alternatively, chromatographic techniques can be employed to effect separation. 
There are many kinds of chromatography which can be used in the context of the present 
inventive methods e.g., adsorption, partition, ion-exchange and molecular sieve, and many 
specialized techniques for using them including column, paper, thin-layer and gas 
chromatography (Freifelder, Physical Biochemistry Applications to Biochemistry and 
Molecular Biology, 2^*** Ed., Wm. Freeman and Co., New York. NY (1982)). 
[0074] Amplification products must be visualized in order to confinn amplification of 
the BORIS sequence. One typical visualization method involves staining of a gel with 
ethidium bromide and visualization under UV Ught. Alternatively, if the amplification 
products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the 
amplification products can then be exposed to x-ray film or visualized under the appropriate 
stimulating spectra, following separation. 
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[0075] In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled, nucleic acid probe is brought into contact with the 
amplified BORIS sequence. The probe preferably is conjugated to a chromophore but may 
be radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such 
as an antibody or biotin, where the other member of the binding pair carries a detectable 
moiety (i.e., a label). 

[0076] One example of the foregoing is described in U.S. Patent No. 5,279,721 , which 
discloses an apparatus and method for the automated electrophoresis and transfer of nucleic 
acids. The apparatus permits electrophoresis and blotting without external manipulation of 
the gel and is ideally suited to carrying out methods according to the present invention. 
[0077] It will be understood that the probes described above are limited in as much as 
any nucleic acid molecule comprising a nucleotide sequence can be used as long as the 
nucleic acid molecule comprising the nucleotide sequence is hybridizable to nucleic acid 
molecules comprising a nucleotide sequence encoding BORIS or a firagment thereof For 
example, a nucleic acid of partial sequence can be used to quantify the expression of a 
structurally related gene or the full-length genomic or cDNA clone fi-om which it is derived. 
[00781 When a polypeptide molecule comprising an amino acid sequence encoding 
BORIS is assayed, various assays (i.e., immunobinding assays) are contemplated to either 
detect or to measure the level of BORIS in a given test sample. In such embodiments, 
BORIS, or an antibody able to recognize antibodies that are specific for BORIS (i.e., an 
anti-idiotypic antibody), can be employed to detect antibodies having reactivity therewith, 
or, alternatively, antibodies can be prepared and employed to detect BORIS or an anti- 
idiotypic antibody thereof. The steps of various useful immunodetection assays have been 
described in Nakamura et al.. Handbook of Experimental Immunology (4* Ed.), Wol. I, 
Chapter 27, Blackwell Scientific Publ., Oxford (1987); Nakamura et al.. Enzyme 
Immunoassays: Heterogenous and Homogenous Systems, Chapter 27 (1987) and include 
Western hybridization (i.e., Westem blots), immunoaffinity purification, immunoaffinity 
detection, enzyme-linked immunosorbent assay (e.g., an ELISA), and radioimmunoassay. 
A microarray also can be used to detect or measure the levels of BORIS. 
[0079] • In general, the immunobinding assays involve obtaining a test sample suspected 
of containing a polypeptide molecule comprising an amino acid sequence encoding BORIS, 
and contacting the test sample with an antibody in accordance with the present invention, as 
the case may be, under conditions effective to allow the formation of immunocomplexes. 
Indeed, a mammal can be diagnosed with a cancer or a predisposition to a cancer by either 
detecting or quantifying the levels of a polypeptide molecule comprising an amino acid 
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sequence encoding BORIS, or an antibody that recognizes an antibody that is specific for 



[0080] Any suitable antibody can be used in conjunction with the present invention. 
Typically, the antibody is specific for BORIS, however, the antibody can recognize other 
antibodies (i.e., an anti-idiotypic antibody) present in a test sample that bind to BORIS. The 
antibody can be a polyclonal or a monoclonal antibody and can be identified using methods 
well known in the art. 

[0081] The inununobinding assays for use in the present invention include methods for 
detecting or quantifying the amount of BORIS in a test sample, which methods require the 
detection or quantitation of any immune complexes formed during the binding process. 
Here, a test sample suspected of containing a polypeptide molecule comprising an amino 
acid sequence encoding BORIS would be obtained from a mammal and subsequently 
contacted with an antibody. The detection or the quantification of the amount of immune 
complexes formed under the specific conditions is then performed. 
[0082) Contacting the test sample with an antibody that recognizes BORIS or an 
antibody that recognizes an antibody that is specific for BORIS under conditions effective 
and for a period of time sufficient to allow formation of immune complexes (primary 
immune complexes) is generally a matter of simply adding the antibody to the sample and 
incubating the mixture for a period of time long enough for the antibodies to form immune 
complexes with, i.e., to bind to, BORIS or an antibody that is specific for BORIS. After 
this time, the sample-antibody composition, such as a tissue section, ELISA plate, dot blot 
or Western blot, will generally be washed to remove any non-specifically bound antibody 
species, allowing only those antibodies specifically bound within the primary immune 
complexes to be detected. 

[0083] In general, the detection of immunocomplex formation is well-known in the art 
and can be achieved through the application of numerous approaches. These methods are 
generally based upon the detection of a label or marker, such as any radioactive, fluorescent, 
biological or enzymatic tags or labels of standard use in the art. U.S. Patents concerning the 
use of such labels include U.S. Patent Nos. 3,817,837, 3,850.752, 3,939,350, 3,996,345, 
4,277,437, 4,275,149 and 4,366,241. Of course, additional advantages can be realized by 
using a secondary binding ligand, such as a second antibody or a biotin/avidin ligand 
binding arrangement, as is known in the art. 

[0084] Alternatively, the first added component that becomes bound within the primary 
immune complexes can be detected by means of a second binding ligand that has binding 
affinity for the first antibody. In these cases, the second binding ligand is, itself, often an 
antibody, which can be termed a "secondary" antibody. The primary immune complexes 
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are contacted with the labeled, secondary binding ligand, or antibody, under conditions . 
effective and for a period of time sufficient to allow the formation of secondary immune 
complexes. The secondary immune complexes are then washed to remove any non- 
specifically bound labeled secondary antibodies or ligands, and the remaining label in the 
secondary immune complexes is then detected. 

[0085] Further methods include the detection of primary immune complexes by a two- 
step approach. A second binding ligand, such as an antibody, that has binding affinity for 
the first antibody is used to form secondary immune complexes, as described above. After 
washing, the secondary immune complexes are contacted with a third binding ligand or 
antibody that has binding affinity for the second antibody, again under conditions effective 
and for a period of time sufficient to allow the formation of immune complexes (tertiary 
immune complexes). The third ligand or antibody is linked to a detectable label, allowing 
detection of the tertiary inmiune complexes thus formed. 

[0086] It will be understood that other diagnostic tests can be used in conjunction with 
the diagnostic tests described herein to enhance further the accuracy of diagnosing a cancer 
or a predisposition to a cancer in a mammal. For example, a monoclonal antibody which is 
known to be specific for a cancer can be used in conjunction with the methods of the 
invention, or the detection of other genetic abnormalities known to.be associated with 
cancer or a predisposition to a cancer can be employed. ^ . . . , 

[0087] In addition to diagnosing a cancer or a predisposition to;a cancer, the present 
invention also provides a method of prognosticating a cancer in a mammal, wherein BORIS 
is a marker for the cancer, which method comprises measuring the level of BORIS in a test 
sample comprising somatic cells obtained from the mammal, wherein the level of BORIS in 
the test sample is indicative of the prognosis of the cancer in the mammal. The level of 
BORIS in the test sample can be measured by comparing the level of BORIS in another test 
sample obtained from the mammal over time in accordance with the methods described 
above. An increase in BORIS levels from one sample to the next is indicative of grov^h 
and/or metastasis of the cancer (i.e., a negative prognosis), whereas no change or a decrease 
in BORIS levels from one sample to the next is indicative of halted growth or even 
reduction of the cancer (i.e., a positive prognosis). 

[0088] The invention also provides a method of assessing the effectiveness of treatment 
of a cancer in a mammal, wherein BORIS is a marker for the cancer, which method 
comprises measuring the level of BORIS in a test sample comprising somatic cells obtained 
from the mammal, wherein the level of BORIS in the test sample is indicative of the 
effectiveness of the treatment of the cancer in the mammal. The level of BORIS in the test 
sample can be measured by comparing the level of BORIS in the test sample to the level of 
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BORIS in another test sample obtained from the mammal over time in accordance, with the 
methods described above. An increase in BORIS levels from one sample to the next is 
indicative of the treatment being ineffective, whereas no change or a decrease in BORIS 
levels from one sample to the next is indicative of the treatment being effective. 
[00891 As used herein, the terai "decreased level" can be defined as detecting BORIS in 
a test sample obtained from a mammal at a level below that which is considered normal. 
For example, the level of BORIS in a test sample is decreased when the copy nimiber of the 
gene encoding BORIS, the mRNA encoding BORIS, or a polypeptide molecule comprising 
an amino acid sequence encoding BORIS is detected at a level below that whichjs 
considered normal. Conversely, the term "increased level" can be defmed as detecting 
BORIS in a test sample obtained from a mammal at a level above that which is considered 
nomial. For example, the level of BORIS in a test sample is increased when the copy 
number of the gene encoding BORIS, the mRNA encodmg BORIS, or a polypeptide 
molecule comprising an amino acid sequence encoding BORIS is detected at a level above 
that which is considered normal. ''Normal levels" pertain to an already determined range of 
BORIS established from cancer-free mammals of the same species and are generally 
accepted and recognized in the art. 

[0090] The present invention ftuther provides a method of treating a- mammal* 
prophylactically or therapeutically for a cancer by administering to the mammjal ah inhibitor 
of BORIS. Typically, the cancer is due to the presence of (i) a nucleic acid molecule 
comprising a nucleotide sequence encoding BORIS or (ii) a polypeptide molecule 
comprising an amino acid sequence encoding BORIS. An inhibitor of (i) or (ii) can be 
administered to the mammal in an amount sufficient to treat prophylactically or 
therapeutically the mammal for the cancer. For example, if the cancer is due to the presence 
of (i), a corresponding inhibitor of (i) can be provided to the mammal by administering to 
the mammal an antisense or a ribozyme molecule specific for (i), wherein the antisense or 
ribozyme molecule inhibits (i) after being administered to the mammal. Alternatively, if the 
cancer is due to the presence of (ii), an inhibitor of (ii) can be provided to* the mammal by 
administering to the mammal a small molecule or an antibody specific for (ii), wherein the 
small molecule or antibody inhibits (ii) after being administered to the mammal. 
[0091] By "prophylactic" is meant the protection, in whole or in part, against a 
particular pathologic state. By "therapeutic" is meant the amelioration of a pathologic state, 
itself, and the protection, in whole or in part, against further infection. One of ordinary skill 
in the art will appreciate that any degree of protection from, or amelioration of, a pathologic 
state is beneficial to a mammal. 
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[0092] A male or female mammal can be diagnosed with» or predisposed to, any cancer 
utilizing the methods of the invention. Similarly, the methods involving prognosticating a 
mammal for a cancer, assessing the effectiveness of treatment of a cancer, and treating a 
mammal prophylactically or therapeutically for a cancer can be utilized with any cancer 
Preferably, the cancer is of epithelial origin and can include: lung cancer; renal cancer; anal 
cancer; bile duct cancer; bladder cancer; bone cancer; brain and spinal chord cancers; breast 
cancer; cervical cancer; lymphoma; colon and rectal cancer; endometrial cancer; esophageal 
cancer; gallbladder cancer; gastrointestinal cancer; laryngeal cancer; leukemia; liver cancer; 
multiple myeloma; neuroblastoma; ovarian cancer; pancreatic cancer; prostatic cancer; 
retinoblastoma; skin cancer (e.g., melanoma and non-melanoma); stomach cancer; testicular 
cancer; thymus cancer; thyroid cancer; as well as other carcinomas and sarcomas. 
[0093] In view of the above, the present invention also provides a composition 
comprising a carrier and either (i) an above-described isolated or purified nucleic acid 
molecule and corresponding fragments thereof, (ii) an above-described vector, (iii) an 
above-described polypeptide molecule and corresponding fragments thereof, or (iv) an 
above-described inhibitor of BORIS. The inhibitor of BORIS can be any compound and/or 
molecule or any other agent capable of inhibiting the normal function of BORIS. Typically, 
the inhibitor of BORIS is a small molecule, an antibody, an antisense molecule, or a 
ribozyme molecule. It is also conceivable to provide an inhibitor of BORISi which 
comprises a molecule (e.g., a zinc finger binding protein) that recognizes zinc finger 
binding domains specific for BORIS and can therefore initiate its inhibition. It will be 
understood that when such zinc finger binding proteins are used, these molecules will be 
employed to specifically recognize zinc finger binding domains of BORIS as compared to 
other proteins comprising similar zinc finger binding domains (e.g., CTCF), such that the 
normal function of these similar proteins is not inhibited. Methods of identifying these 
inhibitors are well known in the art and can be accomplished without any undue 
experimentation using a variety of in vitro assays. 

[0094] The composition can comprise more than one active ingredient, such as 
comprising more than one inhibitor of BORIS. Alternatively, or additionally, the 
composition can comprise another pharmaceutically active agent or drug. For example, 
when treating cancer, other anticancer compounds can be used in conjunction with the 
composition of the present invention and include, but are not limited to, all of the known 
anticancer compounds approved for marketing in the United States and'those that will 
become approved in the future. See, for example. Table 1 and Table 2 of Boyd, Current 
Therapy in Oncology^ Section 1. Introduction to Cancer Therapy (J,E. Niederhuber, ed.). 
Chapter 2. by B.C. Decker, Inc., Philadelphia, 1993. pp. 1 1-22. More particularly, these 
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other anticancer compounds include doxorubicin, bleomycin, vincristine, vinblastine, VP- 
16, VW-26, cisplatin, carboplatin, procarbazine, and taxol for solid tumors in general; 
alkylating agents, such as BCNU, CCNU, methyl-CCNU and DTIC, for brain or kidney 
cancers; and antimetabolites, such as 5-FU and methotrexate, for colon cancer. 
[0095] The carrier can be any suitable carrier. Preferably, the carrier is a 
pharmaceutically acceptable carrier. With respect to compositions, the carrier can be any of 
those conventionally used and is limited only by chemico-physical considerations, such as 
solubility and lack of reactivity with BORIS, and by the route of administration. It will be 
appreciated by one of skill in the art that, in addition to the above-described composition, 
the compositions of the present inventive methods can be formulated as inclusion 
complexes, such as cyclodextrin inclusion complexes, or liposomes. 
[0096] The pharmaceutically acceptable carriers described herein, for example, 
vehicles, adjuvants, excipients, and diluents, are well-known to those skilled in the art and 
are readily available to the public. It is preferred that the pharmaceutically acceptable 
carrier be one which is chemically inert to the BORIS and one which has no detrimental 
side effects or toxicity under the conditions of use. 

[0097] The choice of carrier will be determined in part by the particular BORIS or 
inhibitor of BORIS involved, as well as by the particular method used to administer the 
composition. Accordingly, there are a variety of suitable formulations of the composition of 
the present invention. The following formulations for oral, aerosol, parenteral, 
subcutaneous, intravenous, intramuscular, interperitoneal, rectal, and vaginal administration 
are exemplary and are in no way limiting. 

[0098] One skilled in the art will s^preciate that suitable methods of administering a 
composition of the invention to a mammal, in particular a human, are available, and, 
although more than one route can be used to administer a particular compound, arparticular 
route can provide a more immediate and more effective reaction than another route. 
Accordingly, the herein-described methods are exemplary and are in no way limiting. 
[0099] The dose administered to a mammal, in particular a himian, should be sufficient 
to treat prophylactically or therapeutically the cancer in the mammal. One skilled in the art 
will recognize that dosage will depend upon a variety of factors including the strength of the 
particular composition employed, as well as the age. species, condition, and body weight of 
the mammal. The size of the dose will also be determined by the route, timing, and 
frequency of administration as well as the existence, nature, and extent of any adverse 
side-effects that might accompany the administration of a particular composition and the 
desired physiological effect. 
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[00100] Suitable doses and dosage regimens can be determined by conventional 
range- finding techniques known to those of ordinary skill in the art. Generally, a 
composition is initially administered in smaller dosages, which are less than the optimum 
dose of the composition. Thereafter, the dosage is increased by small increments until the 
optimum effect under the circumstances is reached. The present inventive method will 
typically involve the administration of about 0.1-100 mg of one or more of the compositions 
described above per kg body weight. 



The following examples serve to illustrate the present invention and are not intended 
to limit its scope in any way. 

Example 1 

[001 01 J This example demonstrates the isolation, identification and characterization of 
the human and murine BORIS cDNA sequences. 

[00102] Comparative electrophoretic mobility shift assays (EMS As) of nuclear extracts 
(NEs) were prepared from total rat testis and liver. Several well-characterized CTCF-target 
sequences were utilized as EMSA probes (see, e.g., Ohlsson et al.. Trends Genet, 77:520- 
527 (2001)). Specifically, NEs fi-om rat or mouse testis and liver tissues were prepared 
essentially according to the protocol of Lichtsteiner et al. (see, Lichtsteiner et al.. Cell, 51: 
963-973 (1987)), but with addition of protease and phosphatase inhibitors (see, e.g., 
Klenova et al., A/b/ Cell Biol, 13: 7612-7624 (1993) and Lobanenkov et al.. Oncogene. 5: 
1743-1753 (1990)). The same mhibitors were present in all other protein-containing 
solutions unless otherwise indicated. NEs bom cultured cell lines were obtained with a 
NUN-buffer containing 0.3M NaCl, IM urea, and 1% nonionic detergent Nonidet P-40 (see, 
e.g., Klenova et al., J Biol Chem, 275:26571-26579 (1998) and Filippova et al., Mol Cell 
Biol, 75:2802-2813 (1996)). The length and sequence of each DNA fragment used as a 
probe for EMSA, and labeling and purification of the probes, were essentially as detailed in 
Kanduri et al, CurrBiol, 70:853-856 (2000) and in Filippova et al.. Cancer Research, 62: 
available online (2002). Binding reactions for EMSA were carried out in a buffer 
containing phosphate buffered saline (PBS) with 5mM MgCl2, O.lmM ZnS04, ImM DTT, 
0.1% Nonidet P-40 and 10% glycerol in the presence of poly(dl-dC), double-stranded 
poly(dG)-poly(dC), and a 44-mer ohgonucleotide 

5'-CTAGAGCCCCTCGGCCGCCCCCTCGCGGCGCGCCCTCCCCGCTT-3' (SEQ ID 
NO:5). Such an oligonucleotide harbors overlapping binding sites for Spl, Egrl (Zif268) 
and "poly-G"-binding nuclear factors which can bind to the relatively short GC-rich 
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segments within the extended CTCF sites. Reaction mixhxres of 20 fil were incubated for 
30 minutes at room temperature and then analyzed on 5% nondenaturing PAGE run in 0.5x 
tris-borate-EDTA buffer. For super-shifting EMS A experiments, antibodies in PBS were 
added to the protein-DNA binding reactions. Results obtained from testis >fEs revealed a 
DNA-protein complex with electrophoretic mobility slightly slower than that for the CTCF 
complex. This activity was not detected in NEs prepared from rat liver cells, or from a 
variety of other somatic tissues from rats and mice. 

[00103] The binding activity found only in testis NEs exhibited DNA-binding properties 
like those of CTCF. Indeed, binding activity in testis NEs could be observed only with 
DNA probes bearing known CTCF-target sequences, such as the FII insulator site from the 
chicken globin locus (Bell et al., Cell 95:387-396 (1999)). This activity could be competed 
with an excess of unlabeled DNA fragments bearing other CTCF targets, but not with the 
same fragments mutagenized at specific CTCF-contacting bases (see e.g., Kanduri et al., 
supra, Filippova et al., Nat Genet, 25:335-343 (2001), Klenova et al. (1993), supra, 
Filippova et al. (1996), supra, and Awad et 2\,,JBiol Chem, 274:27092-27098 (1999)), or 
with the same molar excess of additional control DNA fragments of X-phage DNA digested 
with Hindm. 

[00104] It was also found that, like CTCF, the testis-specific factor could be 
"supershifted" in EMSAs with an excess of affinity-purified antibodies against the 
bacterially-expressed, His-tagged, C-terminal part of human CTCF, the region beginning 
from the middle of the 1 1'*^ zinc finger region (ZF) and ending at the stop codon. However, 
in contrast to DNA-bound CTCF, this testis-specific DNA-binding activity could not be 
supershifted by affinity-purified antibodies against the conserved N-terminal region of 
CTCF upstream of the first ZF. Taken together, these results suggest that in addition to 
CTCF, nuclear extracts from testis contained a different form of CTCF or a protein highly 
related to CTCF. 

[00105] To identify the human testis-specific CTCF-like protein(s), a variety of 
oligonucleotides homologous to regions of sequence identity found in the frog, chicken, 
mouse, rat, and human CTCF cDNAs were designed by the Pile-up and Pretty plot 
algorithms of the Wisconsin GCG package. Specifically, fix>g, chicken, mouse, rat, and 
human CTCF cDNA sequences, as well as Drosophila CTCF cDNA (GenBank accession # 
AF3 13621; J. Moore, G. Filippova, and V.V.L., unpublished results) were all included in a 
search for exceptionally conserved short DNA segments for use in designing the PCR- 
screening primers listed in Figm-e 4A. These were used in numerous combinations in 
attempts to PCR-amplify human testis-specific CTCF-like cDNA fingments. As a template, 
the "MARATHON-Ready* human testis cDNA (Clontech, Palo Alto, CA; cat# 7415-1) was 
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used. Annealing temperatures were adjusted according to the lowest Tm of the primer in 
each pair minus 6 °C. Each combination of primer pairs was utilized to work with a 
MasterAmp PGR Optimisation Kit (Epicentre Technologies, cat#MO7201). PGR products 
were analyzed on agarose gels. Distinct DNA-bands were purified and cloned into pGR 
2.1-TOPO vector (Invitrogen) and subsequently sequenced. Over a hundred of resulting 
fragments were cloned into the vector and sequenced. One such fragment displayed a novel 
human cDNA sequence containing an ORF encoding CTCF-like ZFs. This sequence served 
to design the new pairs of primers, NEW/TC/for and NE W/TC/rev (Fig. 5 A), for a stringent 
PGR analyses of the "Rapid Screen Arrayed human testis cDNA Library Panel'* (Origene), 
as well as for the 5'- and 3 '-RAGE with the MARATHON-Ready testis cDNA and adaptor 
primers from the Marathon cDNA Amplification kit (Clontech, Palo Alto, GA). This 
resulted in isolation of a near-full length BORIS cDNA insert in the pCMV6 vector, and of 
the cDNA sequence shown in Fig. 1 A. 5* RACE was peiibmied using GeneRacer kit 
(Invitrogen cat# 45-0079) according to the manufacturers instructions. A similar strategy, 
but based on the finding of near-identical nucleotide sequences in human BORIS and in the 
murine CTGF cDNAs was used to design pairs of primers, listed in Figure 4B, for a PGR- 
mediated screening for the mouse homologue in the MARATHON-ready mouse testes 
cDNA library (Glontech, Palo Alto, CA). Again, after obtaining and sequencing a'fragment 
encoding the mouse BORIS ZF region, new internal specific primers combined with those 
from the Marathon cDNA Amplification kit were utilized to subclone and sequence the 5' 
and 3' termini of the mouse cDNA. This resulted in the murine BORIS cDNA sequence 
shown in Fig. IB that extends to the polyA end, but truncates at the 5*-UTR. Specific 
methods for 5 '-RACE over "difficult" GC-rich region will be used to complete sequence of 
the5'-UTR. 

Example 2 

[00106] This example further demonstrates that BORIS expression is testis-specific. 
[00107] Human and mouse tissues were analyzed for expression of BORIS mRNA by 
hybridization of Northern blots and by RT-PCR with cDNAs prepared commercially. To 
probe Northern blots, the Ndel - AccI fragment of the 5* end of human CTCF cDNA clone 
p7. 1 and Xhol - Xhol fragment of the BORIS cDNA were used. For analyses of the normal 
expression patterns, human BORIS-specific primers (Forward, 5V 
caggccctacaagtgtaacgactgcaa-3' (SEQ ID NO:46) and Reverse, 5*- 
gcattcgtaaggcttctcacctgagtg-3' (SEQ ID NO:47)) were used to amplify human BORIS by 
PGR. Similarly, mouse BORIS-specific primers (Forward, 
5*-gagagacagacaagagagaagagaggttgctc-3* (SEQ ED NO:48) and Reverse, 
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5 -cctgtgtgggtgttcacatggttcctaagaag-3' SEQ ID NO:49)) were used to amplify mouse BORIS 
by PCR. Amplification of human and mouse p-actin with primers provided by OriGene 
was performed in parallel as a control to normalize for gel-loading differences. 
[00108] In sum, these studies demonstrate that expression of BORIS transcripts is strictly 
testis-specific in both mouse and human. It is worth noting that, even with the high 
sensitivity of the RT-PCR method, BORIS expression was below the limits of detection in 
mouse or human ovaries, and in tissues of 8.5-day to 19-day mouse embryos. 

Example 3 

(00109) This example demonstrates that BORIS maps to a position located on human 
chromosome 20. 

[00110] For BORIS chromosome mapping, metaphase spreads were prepared from the 
peripheral blood leukocytes of a normal male donor according to standard procediu-es. The 
entire PAC clone RP4-579F20 (AL160176) containing most of the coding exons (or the 
human BORIS cDNA) was labeled with digoxygenin-l 1-dUTP and used as a probe for 
FISH using the procedure previously described in detail (see, e.g., Pack et al.. Cancer Res.y 
59:5560-5564 (1999)). For cell typing, frozen mouse testis sections were used as a 
template. A mixture of the Coatosome X labeled with Spectrum Orange and Coalosome Y 
labeled with Spectmm Green (Vysis, Downers Grove, IL) was used as a probe. The DNA 
was denatured at 78 °C for 5 minutes and hybridized overnight in a himiidified chamber at 
37 followed by washes at 45 °C in 50% formamide/2xSSC (5-min x3), and O.lxSSC 
(5min x2), 4xSSC/0.1% Tween 20 at RT (2min). Detection of cDNA probe was done 
using anti-Digoxigenin Rhodamine (Roche) or with avidin-FITC (if labeled with biotin -16- 
dUTP). Slides were counterstained with 0.25 mg/ml DAPI-antifade (4*,6-Diamidino-2- 
phenylindole dihydrochloride). 

(001 1 1 1 As indicated above, human BORIS maps to position 20ql 3 on human 
chromosome 20, a region paralogous to CTCF-containing locus at 16q22 and orthologous to 
the H3-H4 region of mouse chromosome 2. Taken together with the results of genomic 
structure analyses, these findings provide evidence that BORIS maps to position 20ql3.2 
and is a CTCF paralogue. 



Example 4 

(001 12] This example demonstrates the evolutionary origin of human BORIS, and, in 
particular, further demonstrates its relationship with CTCF and murine BORIS. 
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[001 13] An optimal alignment of human BORIS and CTCF amino acid sequences (Fig. , 
3B) revealed a remarkable identity of the entire 1 1 2Ts, including all of the major 
DNA-base-recognition residues at positions -1, 2, 3, and 6 within each finger. ZF regions 
are illustrated in Fig. 3A for human BORIS and Fig. 3D for murine BORIS. 
[001 14] To verify that the cloned BORIS cDNA encodes the same CTCF-site-binding 
activity that was initially detected in testis NEs by EMSAs, the clone was used to produce a 
full-length recombinant BORIS in Pichia pastoris yeast as described earlier for CTCF (see, 
e.g., Quitschke et al.. Nucleic Acids Res, 25:3370-3378 (2000)). CTCF was purified as 
originally described by Quitschke et. al. (2000), supra, with modifications outlined recently . 
by Vostrov et. al., J Biol Chem, 8:ms Ml 09748200 in JBC website (2001). Expression of . 
BORIS in yeast was accomplished using the Pichia Expression Kit (Invitrogen Co., 
Carlsbad, CA) according to the manufacturer's instructions, with chromatography steps 
similar to those described for CTCF (see, e.g., Vostrov et al. (2001), supra). Briefly, 
BORIS cDNA EcoRI - NotI DNA fragment from the pCVM6/BORIS cDNA was re-cloned 
into the polycloning site of the pPIC3.5 Pichia vector that directs intracellular recombinant 
protein expression in Pichia pastoris. The vectors containing BORIS cDNA were 
transformed into Pichia strand KM71 by electroporation. After growth to preparative 
quantities (10-15g), Pichia cells were homogenized with a Bead Beater apparatus (Biqspec- . : 
Products, Inc., Bartensville, OK) in buffer containing 40 mM HEPES, pH 7.6. 2niM 
MgS04, ImM EDTA, 10 yM ZnS04, lOOmM KCl. Cell debris was pelleted at 5,000 g for 
10 minutes and the supernatant was further clarified by centrifugation at 100,000 g for 30 
minutes. For use as a non-specific control, wild-type Pichia yeast protein extract also was 
prepared and used as a template for coupled in vitro transcription/translation in reticulocyte 
lysate TnT (Promega). Positive clones were amplified, induced for protein expression and 
screened for the presence of BORIS by Westem blotting. The resulting fiill-length-BORIS 
proteins were analyzed in EMSAs side-by-side with testis and liver NEs. 
[001 15] The results of the EMSAs and NEs demonstrate that recombinant BORIS forms 
a complex with the EH DNA generating the same-mobility EMSA-band as that produced by 
the endogenous BORIS from testis NEs. Conversely, recombinant fulHengdi-CTCF 
produced the faster-migrating band that also is present in NEs from a variety of tissues. 
Similar results were obtained with the proteins produced in TnT-lysates and in Pichia. 
[001 16] The relationship between human and murine BORIS was also analyzed. The 
Bestfit alignment of mouse and human BORIS amino acid sequences (Fig. 3C) 
demonstrated that, while all 1 1 ZFs are practically identical, the regions outside the ZFs are 
only similar. The latter sequences are not as highly conserved as the regions of CTCF that 
flank the ZFs, While outside ZFs, CTCF proteins have > 90% identical amino acids in all 
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vertebrates from frogs to humans, these regions of mouse and human BORIS and CTCF 
proteins manifest no obvious homology to one another and no significant similarities to 
other proteins analyzed by the SMART search engine (http://smart,embl-heidelberg.de). 
(001 17] Thus, the remarkable similarity of "shared" ZFs and absence of significant 
similarities outside the ZFs suggest that, while mammalian CTCF and BORIS recognize the 
same spectrum of DNA sequences, the functional consequences of DNA binding by these 
two proteins is likely to be different. 

Example 5 

[00118] This example demonstrates that BORIS is a novel "cancer-testis" gene 
abnomially activated in multiple mahgnancies and maps to a region frequently amplified in 
a variety of cancers. 

[00119] The human chromosome 20qI3.2 region that encompasses the BORIS gene is 
commonly amplified, or exhibits moderately gains of material, in many human cancers. 
This has led to the suggestion that this region contains a major oncogene or a dominant 
immortalizing gene(s) that can overcome senescence and promote genome instability. The 
localization of human BORIS to this cancer-related chromosomal locus, as well as frequent 
loss of gene imprinting in cancer involving abnormal methylation of CTCF target sites and 
possibly other mechanisms, raised the possibility that aberrant activation of BORIS 
expression in tissues other then testis could be associated with tumor pathogenesis. 
[00120] Northern blot or RT-PCR analyses in a variety of cancer cell lines representing 
most of the major forms of human tumors were tested for the presence of BORIS transcripts 
(see Table 1 below). CTCF-specific primers served as an internal control of the quality of 
both RNA and first-strand cDNA. 



wo 03/072799 PCTA;S03/05186 

32 



Table 1 


Boris Positive 


Boris Negative 


Cancer Type 


Cell Line 


Breast Cancer 


ZR75-1, MDA453, MDA231, 
MCF7/ADR-RES. HS578T, 
CAMA.l, MDA435, DU4475 


MCF7, MDA-MB-23 1 , MDA- 
MB-435. MDA-N. T47D. 
BT549 


Colon Cancer 


C0LO320-HSR, DLDl, 
COLO205, HCC-2998 


HCTl 16, SW48, HCT15, HT29, 
KM12, SW.620 


Bladder Cancer 


5637, T24, J82 


None Identified 


Erythroleukemia 


K562, TF-l 


None Identified 


Glioblastoma 


mi, U3T3, SF-539, SNB-19, 
i9oCj, or -Zoo 


SF-295, SNB-75 


Lymphoma 


BCBLl. Hutl02-TH, H9. Peer-I, 

SU.DHL4, SU-DHL5, SU- 
DHL6, SU.DHL7, SU-DHLIO, 
Kaji, ueigaao, ou, iN%_^co, lera- 
l.Daudi 


L-428. Dev, KM-H2, HSB, SR, 
MOLT4, CEM, Jurkat, RPMI- 

8402,SU-DHLl,KiJK, 
Karpas299, SR786, Wynn, 

TT^lft Wilcnn PW-'X/^ Thnmac, 
jijjOi wuson cw*jo, inomaa- 

0, RPMI-8392, Granta, CCRF- 

CEM 


Non-SmalUCell-Lung Cancer 


A549, EKVX, NCI-H23. NCI- 
H522 


NCI-H2087, NCI-H2228, NCI- 
rl40U, INv^l-rl'fr*-60, iNk-'l- 
H322M, 'h6p-92. HOP-62 


Melanoma 


G361, 624.28-MEL, 624.38- 
MEL, 938-MEL, 1359-MEL, 
83o-M£L, Aj75, OOQOWnS- 
MEL, LOX IMVI, MALME- 
3M, SK-MEL-2, SK-MEL.28, 
SK-MEL-S, UACC-257, UACC- 
62 


n23-MEL.M14 


Myeloma 


KMMl, KMSl, KMS5, KMS18, 
NCI-H929 


KMS12-BM, KMS-11, KMS20, 
RPMI8226, HAAl 


Neuroblastoma 


SK-N-D2, GoTo, SK-N-SH, 
UU8. H4C, SK>N-AS, SK-N- 
DZ 


CLB-Ma. NBL.W, SHSY5Y 


Ovarian Cancer 


IGROVl, OVCAR-3, OVCAR- 
4, OVCAR-8 


OVCAR-5, SK-OV-3 


Prostate Cancer 


Vcap. DuCap, TP2 


LNCap, PC-3. LNCap clone 
FGC, DU145 


Renal Cancer 


TK-10 


RCC, 786-0, A498, ACHN, 
CAKI-l, SN12C, UO-3U RXF- 
393 


Rabdomyosarcoma 


RH30, RDG2 


RH18.RDG7 


Miscellaneous Cancers 


NCCIT, HcLa, U-2-OS, 
QMHKIO, RD-ES, SK-NEP-1, 
JEG-36, PFSK-l 


Katom, SW1088, HT-S, 
SW872, Hep3BH. U937, HL-60 
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[00121] In total, approximately two hundred cell lines were tested. BORIS transcripts 
were detected in more then half (106/193) of the cell lines, although the proportions of 
positive lines varied widely among tumor types. Significant proportions of lymphoma, 
15/37 (40%); breast cancer, 8/14 (57%); melanoma cell lines, 15/17 (88%); and of Wilms 
tumors, 5/9 (56%) (one cell line, and 8 primary tumor samples) expressed BORIS. 
Moreover, preliminary results of the BORIS expression analyses in over fifty randomly 
selected primary breast cancer samples demonstrated that the fi-equency of BORIS abnormal 
expression in these samples is -50-60%. Thus, the fi*equency of abnormal BORIS 
activation in primary breast cancer is similar to that observed in breast cancer cell lines. 
[00122] These results indicate that the normally strict silencing of BORIS in somatic 
tissues is firequently abrogated in many different cancer cell lines. Thus, it is evident that 
BORIS is a novel cancer-testis gene abnormally activated in multiple malignancies. 

Example 6 

[00123] This example demonstrates that BORIS and CTCF compete for similar DNA 
binding regions and that such competition promotes abnormal cell growth. 
[00124] EMSA analyses with DNA probes representing CTCF-binding sequences in the 
H19 ICR and FII insulator site were performed. The fiill-length and DNA-binding 1 1 ZF 
domain versions of BORIS and CTCF were mixed in various proportions. Since the 
isolated 1 1 ZF domains of CTCF and BORIS have in vitro DNA-binding properties similar 
to those of the full-length proteins, each protein was represented by either a full-length 
polypeptide or by its 1 1 ZF domain to facilitate identification of the corresponding bands on 
EMSA gels. The addition of increasing amounts of the BORIS 1 1 ZF protein to EMSA 
reactions with constant amounts of the fiill-length CTCF and DNA from the H19 ICR 
resulted in efficient competition of CTCF/DNA complexes by BORIS. In a reverse 
experiment with a DNA probe containing the FII insulator site, the CTCF 1 1 ZF domain 
efficiently competed for formation of the BORIS/DNA complex. These results provide 
evidence that the in vivo occupancy of a common target for CTCF and BORIS will be 
determined by the relative levels of DNA-binding forms of these proteins in the sub-nuclear 
compartments where CTCF, BORIS, and a target DNA co-localize. 
[00125] To test if competition with CTCF by exogenous expression of BORIS would 
promote growth or transform NIH3T3 cells, which normally express BORIS at levels below 
the limits of detection by RT-PCR, the pCDN-BORIS expression vector was engineered as 
described earlier for CTCF (see, e.g., Rasko et al.. Cancer Res, 67:6002-6007 (2001)). This 
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construct utilizes a CMV promoter for coupled expression of both BORIS and Neo coding « 
regions connected by an internal ribosome entry site (IRES) within one bi-cistronic 
message. In an attempt to establish stable BORIS-expressing cell lines, cells were 
transfected with 0. 1 ^g tolO ^g of either pCDN-BORIS plasmid or a control vector 
expressing Neo but no BORIS. Less then 20 hours after transfection, marked cell death was 
observed in cells transfected with the BORIS-containing constructs. The numbers of 
residual viable cells were inversely proportional to the vector inputs. In contrast, practically 
no cell death was evident in cells transfected with the control vector. When RT-PCR was 
used to analyze total RNA prepared from cells collected one day after transfection^ both 
BORIS and Neo parts of the bi-cistronic message were detected. In additional studies, the 
dead cells from cultures transfected with pCHN-BORIS were removed, and the remaining 
viable cells were cultured in the presence of G41 8 for 14 days. Surprisingly, none of the 
few G4l8-resistant colonies recovered expressed BORIS sequence detectable by RT-PCR, 
but all were positive for the Neo sequence. The toxic effects of constitutive BORIS 
expression from a heterologous promoter were likely due to accumulation of BORIS at 
levels sufficient to compete with most CTCF-DNA interactions in vivo in a manner similar 
to that observed when mixing DNA-targets with CTCF and BORIS in vitro. This would 
cause a complete block of CTCF fimctions since the observed effect of BORIS over- 
expression, namely cell death, is similar to the effects caused by CTCF depletion. 
Therefore, only partial interference with CTCF functions may be permissive for cell 
immortalization and/or transformation, rather then cell death. This hypothesis is supported 
by the recent results of mutational analyses of CTCF in timiors selected for LOH at the 
locus of the human CTCF (16q22). Several ZF-specific missense point-mutations that 
resulted in selective alterations in target site specificities of CTCF binding were found but 
none of these tumors contained truncating CTCF mutations that could cause complete loss 
of CTCF functions. 

[00126] These results indicate that BORIS and CTCF, when present together, compete 
for the same DNA binding sites and that such competition can lead to abnormal cell growth. 

[00127] All of the references cited herein, including patents, patent applications, and 
publications, are hereby incorporated in their entireties by reference. 
[00128] While this invention has been described with an emphasis upon preferred 
embodiments, variations of the preferred embodiments can be used, and it is intended that 
the invention can be practiced otherwise than as specifically described herein. Accordingly, 
this invention includes all modifications encompassed within the spirit and scope of the 
invention as defined by the claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated or purified nucleic acid molecule consisting essentially of a 
nucleotide sequence encoding human brother of regulator of imprinted sites (BORIS) or a 
fragment thereof comprising at least 1536 contiguous nucleotides. 

2. The isolated or purified nucleic acid molecule of claim 1, which (i) encodes 
the amino acid sequence of SEQ ID NO: 2 or a fragment thereof comprising at least 307 
contiguous amino acids, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 
1 or a fragment thereof comprising at least 1536 contiguous nucleotides, (iii) hybridizes 
under highly stringent conditions to an isolated or purified nucleic acid molecule consisting 
essentially of the nucleotide sequence that is complementary to SEQ ID NO: 1 or a 
fragment thereof, or (iv) shares 45% or more identity with SEQ ID NO: 1. 

3. An isolated or purified nucleic acid molecule consisting essentially of a 
nucleotide sequence that is complementary to either of a nucleotide sequence encoding 
human BORIS or a fragment thereof comprising at least 1536 contiguous nucleotides. 

4. The isolated of purified nucleic acid molecule of claim 3, which (i) is 
complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ID 
NO:2, (ii) is complementary to the nucleotide sequence of SEQ ID NO:l or a fragment 
thereof comprising at least 1536 nucleotides, (iii) hybridizes under highly stringent 
conditions to an isolated of purified nucleic acid molecule consisting essentially of SEQ ID 
NO: 1 or a fragment thereof, or (iv) shares 45% or more identity with the nucleotide 
sequence that is complementary to SEQ ID NO: L 

5. An isolated or purified nucleic acid molecule consisting essentially of a 
nucleotide sequence encoding a non-human BORIS or a fragment thereof comprising at 
least 229 contiguous nucleotides. 

6. The isolated or purified nucleic acid molecule of claim 5, which (i) encodes 
the amino acid sequence of SEQ ID NO: 4 or a fragment thereof comprising at least 21 
contiguous amino acids, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 
3 or a fragment thereof comprising at least 229 contiguous nucleotides, (iii) hybridizes 
under moderately stringent conditions to an isolated or purified nucleic acid molecule 
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consisting essentially of the nucleotide sequence that is complementary to SEQ.ID NO: 3 or 
a fragment thereof, or (iv) shares 23% or more identity with SEQ ID NO: 3. 



nucleotide sequence that is complementary to either of a nucleotide sequence encoding a 
non-human BORIS or a fragment thereof comprising at least 229 contiguous nucleotides. 

8. The isolated of purified nucleic acid molecule of claim 7, which (i) is 
complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ED 
N0:4, (ii) is complementary to the nucleotide sequence of SEQ ED NO:3 or a fragment 
thereof comprising at least 229 nucleotides, (iii) hybridizes under moderately stringent 
conditions to an isolated of purified nucleic acid molecule consisting essentially of SEQ ID 
NO:3 or a fragment thereof, or (iv) shares 23% or more identity with the nucleotide 
sequence that is complementary to SEQ ED N0:3- 

9. A vector comprising the isolated or purified nucleic acid molecule of claim 

1, 

10. A vector comprising the isolated or purified nucleic acid molecule of claim 

3. 

11. A vector comprising the isolated or purified nucleic acid molecule of claim 

5. 

1 2. A vector comprising the isolated or purified nucleic acid molecule of claim 

7. 

13. A cell comprising the vector of claim 9. 

14. A cell comprising the vector of claim 10. 

15. A cell comprising the vector of claim 1 1 . 



7. 



An isolated or purified nucleic acid molecule consisting essentially of a 



16. A cell comprising the vector of claim 12. 
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17. An isolated or purified polypeptide molecule consisting essentially of an 
amino acid sequence encoding human BORIS or a fraigment thereof comprising at least 307 
contiguous amino acids, either one of which is optionally glycosylated, amidated, 
carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt 
and/or optionally dimerized or polymerized. 

1 8. The isolated or purified polypeptide molecule of claim 17, which (i) is 
encoded by the nucleotide sequence of SEQ ID NO: 1 or a fragment thereof comprising at 
least 921 contiguous nucleotides, (ii) consists essentially of the amino acid sequence of SEQ 
ID NO: 2 or a fragment thereof comprising at least 307 contiguous amino acids or (iii) 
shares 47% or more identity with SEQ ED NO: 2. 

19. An isolated or purified polypeptide molecule consisting essentially of an 
amino acid sequence encoding a non-himian BORIS or a fragment thereof comprising at 
least 21 contiguous amino acids, either one of which is optionally glycosylated, amidated, 
carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt 
and/or optionally dimerized or polymerized. 

20. The isolated or purified polypeptide molecule of claim 19, which (i) is 
encoded by the nucleotide sequence of SEQ ID NO:3 or a fragment thereof comprising at 
least 63 contiguous nucleotides, (ii) consists essentially of the amino acid sequence of SEQ 
ID NO: 4 or a fragment thereof comprising at least 21 contiguous amino acids or (iii) shares 
40% or more identity with SEQ ID NO: 4. 

21. A cell line that produces a monoclonal antibody that is specific for a region 
of the isolated or purified polypeptide molecule of claim 17, wherein the region comprises 
any region that is recognizable by the monoclonal antibody other than one spanning a zinc 
finger region. 

22. The monoclonal antibody produced by the cell line of claim 2 1 . 

23. A method of diagnosing a cancer or a predisposition to a cancer in a male 
mammal, which method comprises detecting either (i) a nucleic acid molecule comprising a 
nucleotide sequence encoding BORIS or (ii) a polypeptide molecule comprising an amino 
acid sequence encoding BORIS in a test sample comprising somatic cells obtained from the 
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male mammal, wherein the detection of (i) or (ii) in the test sample is indicative of the 
cancer or a predisposition to the cancer in the male mammal. 

24. The method of claim 23, wherein the nucleic acid molecule comprising the 
nucleotide sequence encoding BORIS comprises SEQ ID NO: I . 

25. The method of claim 23, wherein the polypeptide molecule comprising an 
amino acid sequence encoding BORIS comprises SEQ ID NO:2. 

26. A method of predicting a predisposition to a cancer in an offspring of a male 
mammal, which method comprises detecting either (i) a mutation in a nucleic acid molecule 
comprising a nucleotide sequence encoding BORIS, (ii) a decreased level of a polypeptide 
molecule comprising an amino acid sequence encoding wild-type BORIS, or (iii) a mutation, 
in a polypeptide molecule comprising an amino acid sequence encoding BORIS in a test 
sample comprising germ cells obtained from the male mammal, wherein the detection of (i), 
(ii), or (iii) in the test sample is indicative of the cancer or a predisposition to the cancer in 
the offspring of the male mammal. 

27. The method of claim 26, wherein the nucleic acid molecule comprising the 
nucleotide sequence encoding BORIS comprises SEQ ED NO: 1. 

28. The method of any of claims 26, wherein the polypeptide molecule 
comprising an amino acid sequence encoding BORIS comprises SEQ ED N0:2. 

29. A method of diagnosing a cancer or a predisposition to a cancer in a female 
mammal, which method comprises detecting either (i) a nucleic acid molecule comprising a 
nucleotide sequence encoding BORIS or (ii) a polypeptide molecule comprising an amino 
acid sequence encoding BORIS in a test sample obtained from the female mammal, wherein 
the detection of (i) or (ii) in the test sample is indicative of the cancer or a predisposition to 
the cancer in the female mammal. 

30. The method of claim 29, wherein the nucleic acid molecule comprising the 
nucleotide sequence encoding BORIS comprises SEQ ID NO: 1 . 

3 1 . The method of any of claims 29, wherein the polypeptide molecule 
comprising an amino acid sequence encoding BORIS comprises SEQ ID NO:2. 
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32. A method of prognosticating a cancer in a mammaU wherein BORIS is a 
marker for the cancer, which method comprises measuring the level of BORIS in a test 
sample comprising somatic cells obtained from the mammal, wherein the level of BORIS in 
the test sample is indicative of the prognosis of the cancer in the mammal, and wherein the 
level of BORIS in the test sample is measured by comparing the level of BORIS in the test 
sample to the level of BORIS in another test sample obtained from the mammal over time, 
wherein a decrease or no change in the level of BORIS over time is indicative of a positive 
prognosis, and an increase in the level of BORIS over time is indicative of a negative 



33. A method of assessing the effectiveness of treatment of a cancer in a 
mammal, wherein BORIS is a marker for the cancer, which method comprises measuring 
the level of BORIS in a test sample comprising somatic cells obtained from the mammal, 
wherein the level of BORIS in the test sample is indicative of the effectiveness of treatment 
of the cancer in the mammal, and wherein the level of BORIS in the. test sample is measured 
by comparing the level of BORIS in the test sample to the level of BORIS in another test 
sample obtained from the same mammal over time, wherein a decrease or no change in the 
level of BORIS over time is indicative of the treatment being effective, and an increase in 
the level of BORIS over time is indicative of the treatment being ineffective. 

34. A method of treating a mammal prophylactically or therapeutically for 
cancer, wherein the cancer is due to the presence of (i) a nucleic acid molecule comprising a 
nucleotide sequence encoding BORIS or (ii) a polypeptide molecule comprising an amino 
acid sequence encoding BORIS, which method comprises providing an inhibitor of (i) or 
(ii) to the mammal in an amount sufficient to treat prophylactically or therapeutically the 
mammal for the cancer. 

35. The method of claim 34, wherein the cancer is due to the presence of (i) and 
wherein an inhibitor of (i) is provided to the manunal by administering to the manunal an 
antisense or a ribozyme molecule specific for (i), wherein the antisense or ribozyme 
molecule inhibits (i) after being administered to the mammal. 



prognosis. 



36. The method of claim 34, wherein the cancer is due to the presence of (ii) and 
wherein an inhibitor of (ii) is provided to the mammal by administering to the mammal a 
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small molecule or an antibody specific for (ii), wherein the small molecule or antibody 
inhibits (ii) after being administered to the mammal. 

37. A composition comprising an inhibitor of BORIS and a carrier. 

38. The composition of claim 37, wherein the inhibitor of BORIS is a small 
molecule, and wherein the small molecule is present in the composition in an amount 
sufficient to inhibit BORIS. 

39. The composition of claim 37, wherein the inhibitor of BORIS is an antibody, 
and wherein the antibody is present in the composition in an amount sufficient to inhibit 



40. The composition of claim 37, wherein the inhibitor of BORIS is an antisense 
molecule, and wherein the antisense molecule is present in the composition in an amount 
sufficient to inhibit BORIS. 



BORIS. 



41 . The composition of claim 37, wherein the inhibitor of BORIS is an ribozyme 
molecule, and wherein the ribozyme molecule is present in the composition in an amount 
sufficient to inhibit BORIS. 
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ACCCTCCACTCTCGCGCCAGCCCGGCGGCGGCCGGCTGTGGGCTGCAGCACGCGGTGCAC 

GAGGCAGAGCCACAAGCCAAAGACGGAGTGGGCCGAGCATTCCGGCCACGCCTTCCGCGG 

CCAAGTCATTATGGCAGCCACTGAGATCTCTGTCCTTTCTGAGCAATTCACCAAGATCAA 

AGAACTCGAGTTGATGCCGGAAAAAGGCCTGAAGGAGGAGGAAAAAGACGGAGTGTGCAG 

AGAGAAAGACCATCGGAGCCCTAGTGAGTTGGAGGCCGAGCGTACCTCTGGGGCCTTCCA 

GGACAGCGTCCTGGAGGAAGAAGTGGAGCTGGTGCTGGCCCCCTCGGAGGAGAGCGAGAA 

GTACATCCTGACCCTGCAGACGGTGCACTTCACTTCTGAAGCTGTGGAGTTGCAGGATAT 

GAGCTTGCTGAGCATACAGCAGCAAGAAGGGGTGCAGGTGGTGGTGCAACAGCCTGGCCC 

TGGGTTiGCTGTGGCTTGAGGAAGGGCCCCGGCAGAGCCTGCAGCAGTGTGTGGCCATTAG 

TATCCAGCAAGAGCTGTACTCCCCGCAAGAGATGGAGGTGTTGCAGTTCCACGCTCTAGA 

GGAGAATGTGATGGTGGCCAGTGAAGACAGTAAGTTAGCGGTGAGCCTGGCTGAAACTGC 

TGGACTGATCAAGCTCGAGGAAGAGCAGGAGAAGAACCAGTTATTGGCTGAAAGAACAAA 

GGAGCyiGCTCTTTTTTGTGGAAACAATGTCAGGAGATGAAAGAAGTGACGAAATTGTTCT 

CACAGTTTCAAATTCAAATGTGGAAGAAC^GAGGATCAACCTACa^GCTGGTCAAGCAGA 

TGCTGAAAAGGCCAAATCTACAAAAAATCAAAGAAAGACAAAGGGAGCAAAAGGAACCTT 

CCACTGTGATGTCTGCATGTTCACCTCTTCTAGAATGTCAAGTTTTAATCGTCATATGAA 

AACTCACACGAGTGAGAAGCCTCACCTGTGTCACCTCTGCCTGAAAACCTTCCGTACGGT 

CACTCTGCTGCGGAACCATGTTAACACCCACACAGGAACCAGGCCCTACAAGTGTAACGA 

CTGCAACATGGCATTTGTCACCAGTGGAGAACTCGTCCGACACAGGCGCTATAAACATAC 

TCATGAGAAACCCTTTAAATGTTCCATGTGCAAGTATGCCAGTGTGGAGGCAAGTAAATT 

GAAGCGCCATGTCCGATCCCACACTGGGGAGCG'CCCCTTTCAGTGTTGCCAGTGCAGCTA 

TGCCAGCAGAGATACCTACAAGCTGAAACGCCACATGAGAACGCACTCAGGTGAGAAGCC 

TTACGAATGCCACATCTGCCACACCCGCTTCACCCAGAGCGGGACCATGAAAATACATAT 

TCTGCAGAAACACGGCGAAAATGTCCCCAAATACCAGTGTCCCCATTGTGCCACCATCAT 

TGCACGGAAAAGCGACCTACGTGTGCATATGCGCAACTTGCATGCTTACAGCGCTGCAGA 

GCTGAAATGCCGCTACTGTTCTGCTGTCTTCCATGAACGCTATGCCCTCATTCAGCACCA 

GAAAACTCATAAGAATGAGAAGAGGTTCAAGTGCAAACACTGCAGTTATGCCTGCAAGCA 

GGAACGTCATATGACCGCTGACATTCGTACCCACACTGGAGAGAAACCATTCACCTGCCT 

TTCTTGCAATAAATGTTTCCGACAGAAGCAACTTCTAAACGCTCACTTCAGGAAATACCA 

CGATGCAAATTTCATCCCGACTGTTTACAAATGCTCCAAGTGTGGCAAAGGCTTTTCCCG 

CTGGATTAACCTGCACAGACATTCGGAGAAGTGTGGATCAGGGGAAGCAAAGTCGGCTGC 

TTCAGGAAAG6GAAGAAGAACAAGAAAGAGGAAGCAGACCATCCTGAAGGAAGCCACAAA 

GGGTCAGAAGGAAGCTGCGAAGGGATGGAAGGAAGCCGCGAACGGAGACGAAGCTGCTGC* 

TGAGGAGGCTTCCACCACGAAGGGAGAACAGTTCCCAGGAGAGATGTTTCCTGTCGCCTG 

CAGAGAAACCACAGCCAGAGTCAAAGAGGAAGTG6ATGAAGGCGTGACCTGTGAAATGCT 

CCTCAACACGATGGATAAGTGAGAGGGATTCGGGTTGCGTGTTCACTGCCCCCAATTCCT- 

AAAGCAAGTTAGAAGTTTTTAGCATTTAAGGTGTGAAATGCTCCTCAACACGATGGATAA 

GTGAGAGAQAGTCa.GGTTGCATGTTCACTGCCCCTAATTCCTAAAGCAAGTTAGAAATTT 

TTAGCATTTTCTTTGAAAOUVTTAAGTTCATGACyU^TGGATGACACAAGTT 

GTCTAGAATTGTTCTCCTGTTTGTAGCTGGATATTTCAAAGAAACATTGCAGGTATTTTA 

TAAAAGTTTTAAACCTTGAATGAGAGGGTAACACCTCAAACCTATGGATTCATTCACTTG 

ATATTGGCTUWSGt'GGCCCACAATGAGTGAGTAGTGATTTTTGGAT^^ 

AGACCAGCTAGTGCTTCCACAGTCTUVAGCTGGACATTTTTATGTTGCATTATATAC^ 

ATGATATTTCTAATAATATATGGTTTTAAACATTAAAGACAAATGTTTTTATACAAAT6A 

ATTTTCTACAAAATTTAAAGCTACCATAATGCTTTTAATTAGTTCTAAATTCAACCAAAA 

AATGTTTTACTCTTATAAAAAGGAAAACTGAGTAGGAAATGAAATACTAGATTAGACTAG 

AAAATAAGGAATAAATCGATTTTACTTTGGTATAGGAGCAAGGTTCACCTTTAGATTTTT 

GTATTCTCTTTTAATTATGCTCCTTGGCAGGTATGAAATTGCCCTGGTTACATTCCATTA 

TTGCTTATTAGTATTTCACTCCATAACCCTTTTTTCTGCTAAAACTACTCTTTTTATATT 

TGTAAAATAATTGGCAGAGTGAGAAGAAACATAAAATCAGATAAGGCAAATGTGTACCTG 

TAAGGAATTTGTACTTTTTCATAATGCCCAGTGATTAGTGAGTATTTCCCTTTTGCCAGT 

TGACAAGATTTTTCCACCCTCGAGCAGCGTGAGAGATGCCTCTTTAACACTTGAAATTCA 

TTTCTATCTGGATACAGAGGCAGATTTTTCTTCATTGCTTAGTTGAGCAGTTTGTTTTGC 

TGCCAACCTGTCTCCACCCCTGTATTTCAAGATCATTGATAAGCCCTAAATTCAAATTCT 

TAAGATATGGACCTTTTATTGAAAATATCAC AAGTTCAGAATC CC TATACAATGTGAATA 

TGTGGAAATAATTTCCCAGCAGGAAGAGCATTATATTCTCTTTGTACCAGCAAATTAATT 

TAACTCAACTCACATGAGATTTAAATTCTGTGGGCTGTAGTATGCCATCATTGTGACTGA 

ATTTGT6CAATGGTTTCTTAATTTTTTTACTGTTATTTAAAGATGTTTTACATAATTCAA 

TAAAATGAAATGACTTAAAATTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

A (SEQ ID N0:1) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680' 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3260 
3300 
3360 
3420 
3480 
3540 
3541 



1/12 
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CCATTTTGTGCACCTTGATCAAAGCCCATGTCTACTAGGCCCCAGCACCTCTGCACCCCA 
TAAAGATTGCACGCTCTTTTTCCATCAGGGGTCGTCACCATGGCTGCCGCTGAGGTCCCT 
GTCCCTTCTGGGTACTTCACCCAGATCAAAGAGCAGAAGTTGAAGCCTGGAGACCTAGAG 
GAGGAGAAAGAGGAGGACGGGGTACAAAGAGTGGAAGCCCAGGAGGGAGTTGTCAAGGAG 
GTGGAGGCCGAGAACAGTTGCCTGCTTCTGGAGGCCAGGGCCCCGGTGGAGAGCGACAGG 
CGGATCCTGACCCTGCAAACGGTGCACCTGGAGTCCCAGGATGTGCACCTACAGGGGCTG 
GGATGGCTGAGCGTGCCACACTCTGAGGAGCTTTCAGGGACGGTACCAGAGGC6GAAGGC 
ATACTGCAGTTGCCATCCGTGCTGTGGCTCGACCCAGAGCCCCAGCTCAGCCTTCAGCAT 
TGCGTGACGGTCAGCATCCCGGAAGAGCTGTACCCACCAGAGGAGCTGCAGCGGATACAT 
TTTCACCTGCTGAGAGAGAATGTGCTAATGGCCGAGGAGAACCCAGAGTTAACACCAGAC 
TTGGACGAAAGCACAGCCCTGAAAAAGCCCGAAGAAGATGAAAAGGACCAGCTCCCGCCC 
CAGGGAGAGACAGACAAGAGAGAAGAGAGGTTGCTCCTTCTGGAAATGAAACCAAAAGAG 
GGAAAAGACGACGAAATTGTCCTGACCATTTCCCATCTAAGCCTCGAAGAACAGCAAGAT 
CCACCAGCGGCCAATCAGACAAGTGTGCCGGGAGCCAAAGCCGCAAAACCAAAACGGCGG 
AGGCAGACCAAGGGAAAGCCTCAQAGCTTTCAGTGTGACACCTGCCCGTTCACTTCCTCC 
AAGCTCTCAACTTTCAATCGTCACATCAAAATTCACAGCAATGAGAGGCCACACCTGTGT 
CACCTGTGCCTGAAGGCCTTCCGGACTGTCACTCTTCTTAGGAACCATGTGAACACCCAC 
ACAGGAACCAGGCCCCACAAGTGCAGGGACTGCGACATGGCGTTTGTCACCAGCGGAGAA 
CTCGTCCGGCACAGGCGTTACAAACACACTTATGAGAAGCCCTTCAAGTGCTCCCTGTGC 
AAGTACGCCAGCGTCGAGGCAAGCAAGATGAAGCGTCACATCCGCTCACACACGGGTGAG 
CGTCCCTTCCAGTGTTGCCAGTGTGCTTATGCCAGCAGGGACTCCTACAAGCTGAAGCGC 
CACATGAGGACACACTCAGGTGAGAAGCCGTATGAATGTCCCACCTGTCACGTCCGGTTC 
ACCCAGAGCGGGACCATGAAAATCCATATAGCACAGAAGCACGGAGAGAATGTGCCCAAA 
TACGAGTGTCCCCACTGTGCCACCATCATCGCGAGGAAGAGCGACCTGCGTGTCCATCTG 
CGTAACCTGCACAGCCAGAGCCCGGAGGAGATGAAGTGCCGATACTGTCCCGCTGGCTTC 
CATGAGCGCTATGCCCTCATTCAGCACCAGAGGACCCACAAGAACGAGAAGAAGTTCAAG 
TGCAAGCAGTGCGATTACGCGTGCAAGCAGGAGCGATGCTTGAAGGCGCACATGCGCATG 
CACACAGGAGAGAAGCCCTTCTCCTGCCTGGCCTGCAACAAGCACTTCCGACAGAAGCAG 
CTACTGACCGTGCACCTGAGGAAGTACCATGACCCGAACTTCGTCCCCAATCTGCACCTG 
TGCCTCAAGTGTGATAAACGTTTCTCCCGCTGGAGTAACCTGCAGAGACACAGAAAGAAG 
TGTGACCCGGAGCATGAGACGTTAGCCCCCAACAAGGACAGGAGACCAGTGACAAGGACA 
CAGGCCTCGGAGGGAGAAGCAGGACAC7VAGGAAGGGGAGCCTCAGTGCCCTGGGGAGCAG 
GCTCTGGGCcyVCCAAGGAGAAGCAGCGGGGAGCCAGAGCCCAGACCACGGCCTTACCTGC 
GAGATGATCTTTAACATGATGGATAAGTGATGGATAAGTGAGCAGTCGTGCCTCTCCGTG 
CAGTGGCCTCTGGGGGAAGAAACCAGTTAGAAATAAGTTCCCAGACACAGCACAGTGTTC 
TCAGAGTTTGAGATAGTGTGTAGAAATGTTTGAGAGAAGGGGAAAAAAACCCTGCAGCTA 
TTTCCAAAGACTTGAGTCAGAGCTCGAAGTGAAGGTGCACATATCTGGGCCCTAGCAGGT 
GCCCAGAATGAGTCAGGGACAGATTCTAGGTGATACTTATGTCCACGGGGGCTCAGACCA 
GTTAACGCCTTGGTGGTCAGAGCAGAAAATTTTTTGAGTTGTTGTACCCACCCTCAA 
(SEQ ID NO:3) 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

96 0 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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1500 

1560 ■■I 

1620 }i 

1680 

1740 

1800 
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1920 

1980 ' ....... 

2040 > 

2100 
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2280 

2337 
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Fig. 2A 

MAATEISVLSEQFTKIKELELMPEKGLKEEEKDGVCREKDHRSPSELEAERTSGAFQDSV 60 

LEEEVELVLAPSEESEKYILTLQTVHFTSEAVELQDMSLLSIQQQEGVQVVVQQPGPGLL 12 0 

WLEEGPRQSLQQCVAISlQQELYSPQEMEVLQFHALEENVMVASEDSKLAVSLAETAGIil 18 0 

KLEEEQEKNQLLAERTKEQLFFVETMSGDERSDEIVLTVSNSNVEEQEDQPTAGQADAEK 240 

AKSTKNQRKTKGAKGTFHCDVCMFTSSRMSSFNRHMKTHTSEKPHLCHLCLKTFRTVTLL 3 00 

RNHVNTHTGTRPYKCNDCNMAFVTSGELVRHRRYKHTHEKPFKCSMCKYAS VE 360 

WSHTGERPFQCCQCSYASRDTYKLKRlIMRTHSGEKPYECHICHTRFTQSGTMKIHIIiQK 42 0 

HGEWPKYQCPHCATI 1 ARKSDI^VHMRNLHAYSAAELKCRYCSAVFHERYALIQHQKTH 4 80 

KNEKRPKCKHCSYACKQERHMTAHIRTHTGEKPFTCLSCNKCFRQKQLLNAHFRKYHDAN 540 

FIPTVYKCSKCGKGFSRWINLHRHSEKCGSGEAKSAASGKGRRTRKRKQTILKEATKGQK 600 

EAAKGWKEAANGDEAAAiSEASTTKGEQFPGEMFPVACRETTARVKEEVDEGVTCEMLLNT 660 

MDK <SEQ ID NO: 2) 663 
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Fig . 2B 

MAAAEVPVPSGYFTQIKEQKLKPGDliEEEKEEDGVQRVBAQEGVVKEVEAENS 60 

APVESDRRILTLQTVHI^SQDVHLQGLGWLSVPHSEELSGTVPEAEGILQLPSVLWIJDPE 120 

PQLSLQHCVTVSIPEELYPPEELQRIHFHLLRENVLMAEENPELTPDLDESTALKKPEED 180 

EKDQLPPQGETDKI^ERIJJLLEMKPKEGKI}DEIVLTISHLSLEEQQDPPAANQTSVPGAK 240 

AAKPKIUiRQTKGKPQSFQCDTCPFTSSKLSTFNRHIKIHSNERPHLCHLCLKAFRTVTLL 300 

R^^^VNTHTGTRPHKCRDCDMAFVTSGELVRHRRYKHTTO 360 

IRSHTGERPFQCCQCAYASRDSYKLKIUIMRTHSGEKPYECPTCHVRFTQSGTMKIHIAQK 420 

HGENVPKYECPHCATIIARKSDLRVHLRNIJiSQSPEEMKCRYCPAGFHERYAI.IQHQRTH 480 

KiraKKFKCKQCDYACKQERCIiKAHMRMHTGEKPFSCLACNKHFRQKQLLTVHLRKYHDPN 54 0 

PVPNUILCLKCDKRFSRWSNIjQRHRKKCDPEHETLAPNKDRRPV^ 600 

PQCPGEQALGHQGEAAGSQSPDHGIiTCEMIFNMMDK (SEQ ID NO: 4) 636 
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Fig. 3 A 

CAGGGTAAAGCAGGGGCCCTGCCAGGCCTCCGAGGGAGTGTGCTTGGTCTGGCCGAGGGC 6 0 

TGCTTGGCCAAGTCTGGGTGGGCTCGAGGCCACTAGGCCCAAAGCCTGCCTGGCTCTGAG 120 
GGTGCTAGGTCTAGAACCGTGCACGAGGGGAATGCCTGCTCGGGCCCGAACCTCGCTGGG . 180 

CGCCGGGTGTGCACTGGCCCGGGGCCTGCTTGGACCTGAAACTTGCTAGGCCCAGGATAT 240 

GCACTGGCCGAGAGCCTGCTGGGCCCAAACCTTACTAGGCCCAGGATGTTCACTGACTGA 300 

ACCGGCTCAGGCCTAACCTTGCTAGGCCCAGGATATGCACTGGGCCAGAGTGTGCTCAGG 360 

CGGAACCTTGCCAGGC6CAGGATGTGT6CTGGCCCTAAGCCTGCTGAGGCCCAAACCTGT 420 

TCGTTCTAGGGTTTTGTACAAAATCCTGCTTTAGCCTAAATCCTGCTTAGCCTTGACCCC 480 

CTCCTAGACCCAAGCCAGATCAGCATTGTTCTGACCCTACTAAGTCCAAAACCTTTTGAG 540 

GCCAGACCTTGTTTCAACTCCAAAGCCTGCTAGGTTCCAGCACCCCCCGCATCCCTCCTC 600 

ATACCyVCCCCCTTCTCCCCCCTATOGAAACCGCTTGCTTATTTTTOUUlCAGGCCT^ 66 O 

ATTatggcagccactgagatctctgtcctttctgagcaattcaccaagatcaaagaactc 720 

1 MAATEI SVLSEQFTKIKEL 

gagttgatgccggaaaaaggcctgaaggaggaggaaaaagacggagtgtgcagagagaaa 780 

20 EIiMPEKGLKEEEKDGVCREK 

gaccatcggagccctagtgagttggaggccgagcgtacctctggggccttccaggacagc 840 

40 DHRSPSELEAERTSGAFQDS 

gtcctggaggaagaagtggagctggtgctggccccctcggaggagagcgagaagtacatc 900 

60 VLEEEVELVLAPSEESEKYI 

ctgaccctgcagacggtgcacttcacttctgaagctgtggagttgcaggatatgagcttg 960 

80 LTLQTVHFTSE AVELQ DMSL 

ctgagcatacagcagcaagaaggggtgcaggtggtggtgcaacagcctggccctgggttg 1020 

100 LS IQQQEGVQVVVQQPGPGIj 

ctgtggcttgaggaagggccccggcagagcctgcagcagtgtgtggccattagtatccag 1080 

120 LWLEEGPRQS LQQCVAIS IQ 

caagagctgtactccccgcaagagatggaggtgttgcagttccacgctctagaggagaat 1140 

140 QELY S PQEMEVLQFHALE EN 

gtgatggtggccagtgaagacagtaagttagcggtgagcctggctgaaactgctggactg . 1200 

160 VMVAS ED S K L A V . S LAE TA G L 

at caagc tcgaggaagagcaggagaagaaccagt tat tggctgaaagaacaaaggagcag 1260 

180 IKLEEBQEKNQLLAERTKE Q 

ctcttttttgtggaaacaatgtcaggagatgaaagaagtgacgaaattgttctcacagtt 1320 

200 LFPVETMSGDERSDEIVLTV 

tcaaattcaaatgtggaagaacaagaggatcaacctacagctggtcaagcagatgctgaa 1380 

220 SNSKVEEQE DQPTAGQADAE 

aaggccaaatctacaaaaaatcaaagaaagacaaagggagcaaaaggaaccttccactgt 1440 

240 KAKSTKNQRKTKGAKGTFHC 

gatgtctgcatgttcacctcttctagaatgtcaagttttaatcgtcatatgaaaactcac 1500 
260 DVCMFTS SRM S S FNRHMKT H 

accagtgagaagcctcacctgtgtcacctctgcctgaaaaccttccgtacggtcactctg 1560 
280 TSEKPHLCH IiCLK TFRTVTL 

ctgcggaaccatgttaacacccacacaggaaccaggccctacaagtgtaacgactgcaac 1620 
300 LRNHVNTHTGTRPYKCNDCN 



atggcatttgtcaccagtggagaactcgtccgacacaggcgctataaacatactcatgag 1680 
320 MAFVTSGELVRHRRYKHTHE 

aaaccctttaaatgttccatgtgcaagtatgccagtgtggaggcaagtaaattgaagcgc 1740 
340 KPFKCSMCKYASVEAS KLKR 

S3oa = s»ass»=iss>an=: = =:== ZF4 = = = = = =: = = J3==s=5 = = =s = = == = = 

catgtccgatcccacactggggagcgcccctttcagtgttgccagtgcagctatgccagc 1800 
360 HVRSHTGERPFQCCQCSYAS 

agagatacctacaagctgaaacgccaca tgagaacgcactcaggtgagaagcc t tacgaa 1860 
380 RDTYKLKRHMRTHSGEKPYE 

tgccacat ctgccacacccgc 1 1 cacccagagcgggaccatgaaaatacatat t ctgcag 1920 
400 CHICHTRPTQSGTMKIHILQ 



5/12 



wo 03/072799 PCT/US03/05186 

Pig. 3 A (cont.if 



aaacacggcgaaaatgtccccaaataccagtgtccccattgtgccaccatcattgcacgg , 1980 
420 KHGENVPKYQC PHCATIIAR 

aaaagcgacctacgtgtgcatatgcgcaacttgca tgc t tacagcgctgcagagctgaaa 2040 
440 KSDLRVHMRNLHAYSAAELK 

sssaacs ZF7 CJ=3=3=3C3r3r3=! = ia=acaca==a=3 = = ==3 = c= = =i 

tgccgctactgttctgctgtcttccatgaacgctatgccctcattcagcaccagaaaact 2100 
460 CRYCSAVPHERYALIQHQK T 

cataagaatgagaagaggtt caagtgcaaacactgcagt ta tgcctgceiagcaggaacgt 2 16 0 
480 HKNEKRF KCKHCSYACKQER 

s =r = = =s = = = = = = = ==2 = = = = = ==== ZF9 === = = = = = 

catatgaccgctcacattcgtacccacactggagagaaaccattcacctgcctttcttgc 2220 
500 HMTAHIR THTGEKPFTCLSC 

aataaatgtttccgacagaagcaacttctaaacgctcacttcaggaaataccacgatgca 2280 
520 NKCFRQKQLIiNAHFRKYHDA 

s==sss = s3== = =:=5 ZFIO = = = = — = = = = = = = =: = = = = = = = = = = = = = = = = = =: = == = = 

aatttcatcccgactgtttacaaatgctccaagtgtggcaaaggcttttcccgctggatt 2340 
540 NPIPTVYKCSKCGKGFSRWI 

= «=3=s=s= = ss = = = e3s= = = ====s ZFll s=sa = cs=s=»=sssw 

aacctgcacagacattcggagaagtgtggatcaggggaagcaaagtcggctgcttcagga 2400 
560 NLHRHS E K C G S G EA KS AA S G 

aagggaagaagaacaagaaagaggaagcagaccatcctgaaggaagccacaaagggtcag 2460 
580 KGRRTRKR.KQTILKEATKGQ 

aaggaagctgcgaagggatggaaggaagccgcgaacggagacgaagctgctgctgaggag 2520 
600 KEAA.KGWKEAANGDEAAAEE 

gcttccaccacgaagggagaacagttcccaggagagatgtttcctgtcgcctgcagagaa 2580 
620 ASTTK6EQFPGEMFPVACRE 

accacagccagagtcaaagaggaagtggatgaaggcgtgacctgtgaaatgctcctcaac 2640 
640 TTARVKEEVDEGVTCEMLLN 

acga tggat aagTGAGAGQQATTCGGGTTGCGTGTTCACTGCCCCCAATTCCTAAAGCAA 2700 
660 T M D K 

GTTAGAAGTTTTTAGCATTTAAGGTGTGAAAT6CTCCTCAACACGATGGATAAGTGAGAG 2760 

AGAGTCAGGTTGCATGTTCACTGCCCCTAATTCCTAAAGCAAGTTAGAAATTTTTAGCAT 2820 

TTTCTTTGAAACAATTAAGTTaiTGACTVATGGATGACACAAGTTT 2880 

ATTGTTCTCCrGTTTGTAGCTGGATATTTCAAAGAAACATTGCAGGTATT^ 2 94 0 

TTTAAACCTTGAATGAGAGGGTAACACCTCUU^CCTATGGATTCATTCACTTGATAT^^ 3000 

CAAGGTGGCCOlCAATGAGTGAGTAGTGATTTTTGGATATTTCAAAATAGTCrAGACCAG 3060 

CTAGTGCTTCCACAGTCAAAGCTGGACATTTTTATGTTGCATTATATAavCCCATGATAT 3 12 0 

TTCTAATAATATATGGTTTTAAACATTAAAGACAAATGTTTTTATACMATGAATTCT 3180 

ACAAAATTTAAAGCTACCATAATGCITTTAATTAGTTCTAAATTCAACCAAA;^^ 3240 

TACTCTTATAAAAAGGAAAACTGAGTAGGAAATGAAATACTAGATTAGACTAGAAAATAA 3300 

GGAATAAATCGATTTTACTTTGGTATAGGAGCAAGGTTCACCTTTAGATTTTTGTATTCT 3360 

(nTTTAATTATGCTCCTTGGCAGGTATGAAATTGCCCTGGTTACATTCCy^TTATTGCTTA 3420 

TTAGTATTTCACTCCATAACCCTTTTTTCTGCTAAAACTACTCTTTTTATATTTGTAAAA 34 80 

TAATTGGCAGAGTGAGAAGAAACATAAAATCAGATAAGGCAAATGTGTACCTGTAAGGAA 3540 

TTTGTACTTTTTCATAATGCCCAGTGATTAGTGAGTATTTCCCTTTTGCCAGTTGACAAG 3600 

ATTTTTCCACCCrCGAGCAGCGTGAGAGATGCCTCTTTAACACTTGAAATTCATTTCTAT 3660 

CTGGATACAGAGGCAGATTTTTCTTCATTGCTTAGTTGAGCAGTTTGTTTTGCTGCCAAC 3720 

CTGTCTCCACCCCrGTATTTCAAGATCATTGATAAGCCCTAAATTCAAATTCTTAAGATA 3780 

TGGACCTTTTATTGAAAATATCACAAGTTCAGAATCCCTATACAATGTGAATATGTGGAA 3840 

ATAATTTCCCAGCAGGAAGAGCATTATATTCTCTTTGTACCAGCAAATTAATTTAACTCA 3900 

ACTCACATGAGATTTAAATTCTGTGGGCTGTAGTATGCCATCATTGTGACTGA^ 3 960 

CJUVTGGTTTCTTAATTTTTTTACTGTTATTTAAAGATGTTTTACATAATTCAATAAAATG 4020 

AAATGACTTAAAATTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 4080 
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Fig. 3B 



PCT/US03/05186 



MAATEIS-VLSEQFTKIKELEIiMPEKQLKEEEKDGVCREKDHRSPSELEAERTSG 54 

MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQ NQTDG 4 7 

AFQDS VliEE EV- ELVLAPSEES E KYILTLQTVHFT 127 

GEWQDVNSSVQMVMMEQIJDPTLLQMKTEVMEGTOAPEAEAAVDDTQIITLQVVN^ 104 

SEAV ELQDMSLLSIQQQEGVQVWQQPGPGLLWLEEGPRQSLQQCVAISIQQELYSPQ 145 

EQPINIGELQ LVQVPVPVTVP - VATTSVEE LQGAYENEVSKEGLAES 150 

EMEVLQFHALEE- - JTVMVASEDSKIiAVSIAETAGLIKLEEEQEKN QLLAERTKEQLFFVE 163 

- - EPMI CHTLPLPEGFQVVKVGANGEVETLEQGELPPQBDPSWQKDPDYQPPAKKTKKTKKSKL 212 



TMSGDERSDBIVLTVSNSNVEEQEDQPTAGQADAEKA KSTKNQRKTKGAKGT 

RYTEEGKD VDVSVYDFEEE(K3EGLLSEVNAEKWGNMKPPKPTOIKKKGVKICr 



ERPHK^^^ 

;kpfk 




TGTRP 




^GiKPYE S^gE^mgL^ 

KR 




|gS GEAKSAASGKGRRTRKRKQTILKEATKGQKE 




G PDGVEGENGGETKKS KRGRKRKMR S KKEDS SDSEN 

AAKGWKEAANGDEAAAEEASTTKGEQFPGEMFPVACRETTAR 

AEPDL--DDNEDEEEPAVEIEPEPEPQPVTPAPPPAiCKRRGRPPGRTNQPKQNQP 

VKEEVPEGVTCEMLT..NTMDK 

TAI IQVEDQNTGAIENI IVEVKKEPDAEPAEGEEEEAQPAATDAPNGDIiTPEMILSMMDR 



256 
265 
312 

321 
369 

378 
427 

436 
485 

494 
545 

554 
601 

614 
643 
667 
663 
727 
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mass U 



Ieq^i^^leImpekg: 



^scllSear 
tsga|qds 



— -^-p 

VLEEEVELVL^SBI 




60 
60 

120 
109 

180 
167 

239 
221 

297 
277 

354 
334 

410 
390 

470 
450 

526 
506 

581 
562 

615 
615 

647 
654 
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Pig. 3D 

CCATTTTGTGCACCTTGATCAAAGCCCATGTCTACTAGGCCCCAGCACCTCTGCACCCCA 6 0 
TAAAGATTGCACGCTCn^TTTCCATCAGGGGTCGTCACCatggctgccgctgaggtccct 120 
1 MAAAEVP 

gtcccttctgggtacttcacccagatcaaagagcagaagttgaagcctggagacctagag 180 
8VPSG YFTQIKEQK LKPGD.LE 

gaggagaaagaggaggacggggt acaaagag tggaagcccaggagggag t tgt caaggag 240 
28E£KEEOGVQRVEAQE6VV 'KE 

gtggaggccgagaacagttgcctgcttctggaggccagggccccggtggagagcgacagg 300 
48VEAENSCLnL EAR. APVESDR 

cggatcctgaccctgcaaacggtgcacctggagtcccaggatgtgcacctacaggggctg .360 
68RILTLQTVHLESQDVHLQGL 

ggatggctgagcgtgccacactctgaggagctttcagggacggtaccagaggcggaaggc 420 
88 6WLSVPKSEELS GTVPEAEG 

atactgcagttgccatccgtgctgtggctcgacccagagccccagctcagccttcagcat 480 
108 ILQLPSVLWLDPEPQLSLQH 

tgcgtgacggtcagcatcccggaagagctgtacccaccagaggagctgcagcggatacat 540 
128 CVTVSIPEELYPPEELQRIH 

tttcacctgctgagagagaatgtgctaatggccgaggagaacccagagttaacaccagac 600 
148 FHLL RENVLMAEENPELTPD 

ttggacgaaagcacagccctgaaaaagcccgaagaagatgaaaaggaccagctcccgccc 660 
168 LDESTALKKP'EEDEKDQLPP 

cagggagagacagacaagagagaagagaggttgctccttctggaaatgaaaccaaaagag 720 
188 QGETDKREERIililiLEMKPKE 

ggaaaagacgacgaaattgtcctgaccatttcccatctaagcctcgaagaacagcaagat 780 
208 GKDDEIVLTISHLSIiEEQQD 

ccaccagcggccaatcagacaagtgtgccgggagccaaagccgcaaaaccaaaacggcgg 640 
228 PPAANQTSVPGAKAAKPKRR 

aggcagaccaagggaaagcctcagagctttcagtgtgacacctgcccgttcacttcctcc 900 
248 RQTKGKPQSFQCDTCPFTSS 

aagctctcaactttcaatcgtcacatcaaaattcacagcaatgagaggccacacctgtgt: 960 
268 KLSTFNRHIKIHSNERPHIiC 



cacc tg tgc c tgaaggcc 1 1 ccggac tg t cac t c 1 1 c t taggaacca tg tgaacacccac 1020 
288 HLCIiKAFRTVTLLRNHVNTH 

acaggaaccaggccccacaagtgcagggactgcgacatggcgt t tgtcaccagcggagaa 1080 
308 TGTRPHKCRD.CDMAFVTSGE 

ctcgtccggcacaggcgttacaaacacacttatgagaagcccttcaagtgctccctgtgc 1140 
328 liVRHRRYKHTYEKPFKCSLC 



aagtacgccagcgt cgaggcaagcaaga tgaagcgtcacat ccgctcacacacgggtgag 1200 
348 KYAS VEAS KMKRHIRSHTGE 

S9=:=: = =: = =s=3ssss=sasZF43Bas:=3S3SS=3=3=r===s = = =:ss = z3ss = = s=s&sasass 

cgtcccttccagtgttgccagtgtgcttatgccagcagggactcctacaagctgaagcgc 1260 
368 RPFQCCQCAYASRDSYKLKR 

ss = ssBS8S8ait3S3 = = as=s = sssssasZF5sssa=aaaas3aasBS3S3B 

cacatgaggacacactcaggtgagaagccgtatgaatgtcccacctgtcacgtccggttc 1320 
388 HMRTHSGEKPYECPTCHVRP 

acccagagcgggaccatgaaaatccatatagcacagaagcacggagagaatgtgcccaaa 1380 
408 TQSGTMKIHIAQKHGBNVPK 

tacgagtgtccccactgtgccaccatcatcgcgaggaagagcgacctgcgCgtccatctg 1440 
428 YECPHCATI lARKSDLRVHL 

ssass8aaaass = assBSSasBassZF7 = = ==== — ==:^ = a=>Bsss=3S = = = = =3 

cgtaacctgcacagccagagcccggaggagatgaagtgccgatactgtcccgctggcttc 1500 
448 RNLHSQSPBEMKCRYCPAGF 

saasasssacsa sssssssssssassassssaasss: 

catgagcgctatgccctcattcagcaccagaggacccacaagaacgagaagaagttcaag 1560 
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Fig. 3D 

468 HERYALIQHQRTHKNEKKFK 

B aa = = Z F8 3 3 = = = =:= = = = = =3 =s s) s B 3 a = a a £3 = 8 s a s= a = 

tgcaagcag tgcgat tacgcg tgcaagcaggagcga tgc t tgaaggcgcaca tgcgca tg 1620 
488 CKQCDYACKQERCLKAHMRM 

zi = = sssssss = sS3Sssss=sBsa8as3ZF9 = 3asaassss8aasasaas=sss = ssii30=; 

cacacaggagagaagcccttctcctgcctggcctgcaacaagcacttccgacagaagcag 1680 
508 H TGEKPFSCLACNKHFRQKQ 

ctactgaccgtgcacctgaggaagtaccatgacccgaacttcgtccccaatctgcacctg 1740 
528 LLTVHLRKYHDPNF VPNLHL 

aaaasBaBsascsassssssaaacBaBSsas 

tgcctcaagtgtgataaacgtttctcccgctggagtaacctgcagagacacagaaagaag 1800 
548 CLKCOKRFSRWSNLQRHRKK 

tgtgacGcggagcatgagacgttagcccccaacaaggacaggagaccagtgaoaaggaca 1860 
568 CDPEHETLAPNKDRRPVTRT 

caggcctcggagggagaagcaggacacaaggaaggggagcctcagtgccctggggagcag 1920 
588 Q ASEGEAGHKEGEPQ CPGEQ 

gctctgggccaccaaggagaagcagcggggagccagagcccagaccacggccttacctgc 1980 
608. A LGHQGEAAG SQS PDH GLTC 

gagatgatctt taacatgatggataagTGATGGATAAGTGAGCAGTCGTGCCTCTCCGTG 2040 
628 EMIFNMMDK 

CAGTGGCCTCTGGGGGAAGAAACCAGTTAGAAATAAGTTCCCAGACACAGCACAGTGTTC 210 0 
TCAGAGTTTGAGATAGTGTGTAGAAATGTTTGAGAGAAGGGGAAAAAAACCCTGCAGCTA 2160 
TTTCCAAAGACTTGAGTCAGAGCTCGAAGTGAAGGTGCACATATCTGGGCCCTAGCAGGT 2220 
GCCCAGAATGAGTCAGGGACAGATTCTAGGTGATACTTATGTCCACGGGGGCTCAGACCA 2280 
GTTAACGCCTTGGTGGTCAGAGCAGAAAATTTTTTGAGTTGTTGTACCCACCCTCAA 2340 
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Pig, 4A 

ForNl GA6CCTGTGGAGCGATTAAACC 

RevNl CCGCCGCCGCTCCAC 

ForN2 CTTOTTTGGCGGCAGCGGCG 

RevN2 CGCGCCACACCCCCCGC 

PorN3 CCCCAGAACCAGAC 

RevN3 ACTTCAGTCTTCATCTG 

ForZPl TGTGAGCTTTGCAGTTACAC 

RevZFl ACTGTTCTGAATGCCCTG 

ForZF2 CGGCGTTCAAATTTGG 

RevZF2 CGAGTACCTGTGTGTQTGTT 

ForZP3 GTGCCCAGACTGCGA 

RevZF3 AATCGCACATGGAACAC 

ForZF4 TTCAAGTGTTCCATGTG 

RevZF4 CTGCTGGCATAACTGCAC 

ForZFS CACATACAAGCTGAAAAGG 

RevZFS GCATCTTCATGGTACCAC 

ForZFS GTCATAGCCCGAAAAAGTG 

RevZFS CGCTCATGAAACACAGC 

ForZFV GTGTGACCAGTGTGATTA 

RevZF? TTCTGGCGGAAGGTCTT 

ForZFS CAAGCGCTATCACGACC 

RevZFS TCTGCAT6TCTTGCCAT 

ForCl TCCTCTQACAGTGAAAATGC 

RevCl CACAGGCTOAGGCTCTGG 

ForC2 . CAGAATACAGGTGCAATTG 

RevC2 CACCGGTCCATCATGCTG 

NEWTCFOR GCCAGT6TGGAGGCAAGTAAATTGAAG 

NEWTCREV CACTGGCAACACTGAAAGGGGCGCTCCCC 
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Fig. 4B 

MBIFOR TCGTCATATGAAAACTCACACC (SEQ ID NO: 34) 

MBIREV GACGAGTTCTCCACTGGTG (SEQ ID NO: 35) 

MB2FOR AACATACTCATGA.GAAACCC (SEQ ID NO: 36) 

MB2REV GAGTGCGTTCTCATGTGG (SEQ ID NO: 37) 

MB3FOR GAGCGCCCCTTTCAGTGT (SEQ ID NO: 38) 

MB3REV GCACAAT6GGGACAC (SEQ ID NO: 39) 

MB4FOR ACCCAGAGCGGGACCATGAAA (SEQ ID NO: 40) 

MB4REV GACAGCAGAACAGTAGCGG (SEQ ID NO: 41) 

MB5FOR CATAAGAATGAGAAGA6G (SEQ ID NO: 42) 

MB5REV AAGTTGCTTCTGTCGGAAA. (SEQ ID NO: 43) 

MBNEWFOR TTGTGCAGTTATGCCAGCAGG'^ (SEQ ID NO: 44) 

MBNEWREV GTGCTTCTGTAAAATGTGCATC (SEQ ID NO: 45) 
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SEQUENCE LISTING 

<110> GOVERNMENT OF THE UNITED STATES OF AMERICA, REPRESNTED BY THE SECRETARY, 
DEPARTMENT OF HEALTH AND HUMAN SERVICES 

LOBANENKOV, VICTOR V 

LOUKINOV, DMITRI I 

MORSE. HERBERT C 

<120> BROTHER OF THE REGULATOR OF IMPRINTED SITES (BORIS) 

<130> 221324 

<150> US 60/358,889 
<151> 2002-02-22 

<160> 49 

<170> Patentin version 3.1 

<210> 1 

<211> 3541 

<212> DNA 

<213> Homo sapiens 



<400> 1 
accctccact 


ctcgcgccag 


cccggcggcg 


gccggctgtg 


ggctgcagca 


cgcggtgcac 


60 


gaggcagagc 


cacaagccaa 


agacggagtg 


ggccgagcat 


tccggccacg 


ccttccgcgg 


120 


ccaagtcatt 


atggcagcca 


ctgagatctc 


tgtcctttct 


gagcaattca 


ccaagatcaa 


180 


agaactcgag 


ttgatgccgg 


aaaaaggcct 


gaaggaggag 


gaaaaagacg 


gagtgtgcag 


240 


agagaaagac 


catcggagcc 


ctagtgagtt 


ggaggccgag 


cgtacctctg 


gggccttcca 


300 


ggacagcgtc 


ctggaggaag 


aagtggagct 


ggtgctggcc 


ccctcggagg 


agagcgagaa 


360 


gtacatcctg 


accctgcaga 


cggtgcactt 


cacttctgaa 


gctgtggagt 


tgcaggatat 


420 


gagcttgctg 


agcatacagc 


agcaagaagg 


ggtgcaggtg 


gtggtgcaac 


agcctggccc 


480 


tgggttgctg 


tggcttgagg 


aagggccccg 


gcagagcctg 


cagcagtgtg 


tggccattag 


540 


tatccagcaa 


gagctgtact 


ccccgcaaga 


gatggaggtg 


ttgcagttcc 


acgctctaga 


600 


ggagaatgtg 


atggtggcca 


gtgaagacag 


taagttagcg 


gtgagcctgg 


ctgaaactgc 


660 


tggactgatc 


aagctcgagg 


aagagcagga 


gaagaaccag 


ttattggctg 


aaagaacaaa 


720 


ggagcagctc 


ttttttgtgg 


aaacaatgtc 


aggagatgaa 


agaagtgacg 


aaattgttct 


780 


cacagtttca 


aattcaaatg 


tggaagaaca 


agaggatcaa 


cctacagctg 


gtcaagcaga 


840 


tgctgaaaag 


gccaaatcta 


caaaaaatca 


aagaaagaca 


aagggagcaa 


aaggaacctt 


900 


ccactgtgat 


gtctgcatgt 


tcacctcttc 


tagaatgtca 


agttttaatc 


gtcatatgaa 


960 


aactcacacc 


agtgagaagc 


ctcacctgtg 


tcacctctgc 


ctgaaaacct 


tccgtacggt 


1020 


cactctgctg 


cggaaccatg 


ttaacaccca 


cacaggaacc 


aggccctaca 


agtgtaacga 


1080 


ctgcaacatg 


gcatttgtca 


ccagtggaga 


actcgtccga 


cacaggcgct 


ataaacatac 


1140 
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tcatgagaaa ccctttaaat gttccatgtg caagtatgcc agtgtggagg caagtaaatt 1200 

gaagcgccat gtccgatccc acactgggga gcgccccttt cagtgttgcc agtgcagcta 1260 

tgccagcaga gatacctaca agctgaaacg ccacatgaga acgcactcag gtgagaagcc 1320 

ttacgaatgc cacatctgcc acacccgctt cacccagagc gggaccatga aaatacatat 1380 

tctgcagaaa cacggcgaaa atgtccccaa ataccagtgt ccccattgtg ccaccatcat 1440 

tgcacggaaa agcgacctac gtgtgcatat gcgcaacttg catgcttaca gcgctgcaga 1500 

gctgaaatgc cgctactgtt ctgctgtctt ccatgaacgc tatgccctca ttcagcacca 1560 

gaaaactcat aagaatgaga agaggttcaa gtgcaaacac tgcagttatg cctgcaagca 1620 

ggaacgtcat atgaccgctc acattcgtac ccacactgga gagaaaccat tcacctgcct 1680 

ttcttgcaat aaatgtttcc gacagaagca acttctaaac gctcacttca ggaaatacca 1740 

cgatgcaaat ttcatcccga ctgtttacaa atgctccaag tgtggcaaag gcttttcccg 1800 

ctggattaac ctgcacagac attcggagaa gtgtggatca ggggaagcaa agtcggctgc 1860 

ttcaggaaag ggaagaagaa caagaaagag gaagcagacc atcctgaagg aagccacaaa 1920 

gggtcagaag gaagctgcga agggatggaa ggaagccgcg aacggagacg aagctgctgc 1980 

tgaggaggct tccaccacga agggagaaca gttcccagga gagatgtttc ctgtcgcctg 2040 

cagagaaacc acagccagag tcaaagagga agtggatgaa ggcgtgacct gtgaaatgct 2100 

cctcaacacg atggataagt gagagggatt cgggttgcgt gttcactgcc cccaattcct 2160 

aaagcaagtt agaagttttt agcatttaag gtgtgaaatg ctcctcaaca cgatggataa 2220 

gtgagagaga gtcaggttgc atgttcactg cccctaattc ctaaagcaag ttagaaattt 2280 

ttagcatttt ctttgaaaca attaagttca tgacaatgga tgacacaagt ttgaggtagt 2340 

gtctagaatt gttctcctgt ttgtagctgg atatttcaaa gaaacattgc aggtatttta 2400 

taaaagtttt aaaccttgaa tgagagggta acacctcaaa cctatggatt cattcacttg 2460 

atattggcaa ggtggcccac aatgagtgag tagtgatttt tggatatttc aaaatagtct 2520 

agaccagcta gtgcttccac agtcaaagct ggacattttt atgttgcatt atatacaccc 2580 

atgatatttc taataatata tggttttaaa cattaaagac aaatgttttt atacaaatga 2640 

attttctaca aaatttaaag ctaccataat gcttttaatt agttctaaat tcaaccaaaa 2700 

aatgttttac tcttataaaa aggaaaactg agtaggaaat gaaatactag attagactag 2760 

aaaataagga ataaatcgat tttactttgg tataggagca aggttcacct ttagattttt 2820 

gtattctctt ttaattatgc tccttggcag gtatgaaatt gccctggtta cattccatta 2880 

ttgcttatta gtatttcact ccataaccct tttttctgct aaaactactc tttttatatt 2940 

tgtaaaataa ttggcagagt gagaagaaac ataaaatcag ataaggcaaa tgtgtacctg 3000 

taaggaattt gtactttttc ataatgccca gtgattagtg agtatttccc ttttgccagt 3060 
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t"i"i" rraccct 


cgagcagcgt 


gagagatgcc 


tctttaacac 


ttgaaattca 


3120 


tttctatctg 


gatacagagg 


cagatttttc 


ttcattgctt 


agttgagcag 


tttgttttgc 




tgccaacctg 


tctccacccc 


tgtatttcaa 


gatcattgat 


aagccctaaa 


ttcaaattct 


3240 


taagatatgg 


accttttatt 


gaaaatatca caagttcaga 


atccctatac 


aatgtgaata 


3300 


tgtggaaata 


atttcccagc 


aggaagagca 


ttatattctc 


tttgtaccag 


caaattaatt 


3360 


taactcaact 


cacatgagat 


ttaaattctg tgggctgtag 


tatgccatca ttgtgactga 


3420 


atttgtgcaa 


tggtttctta 


atttttttac 


tgttatttaa 


agatgtttta 


cataattcaa 


3480 


taaaatgaaa 


tgacttaaaa 


ttgcaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


3540 


a 












3541 



<210> 2 

<211> 663 

<212> PRT 

<213> Homo sapiens 

<400> 2 



Met Ala Ala Thr Glu He Ser val Leu Ser Glu Gin Phe Thr Lys lie 
15 10 15 

Lys Glu Leu Glu Leu Met Pro Glu Lys Gly Leu Lys Glu Glu Glu Lys 
20 25 30 

Asp Gly val Cys Arg Glu Lys Asp His Arg ser Pro Ser Glu Leu Glu 
35 40 45 ' ' 

Ala Glu Arg Thr Ser Gly Ala Phe Gin Asp Ser val Leu Glu Glu Glu 
50 55 60 

val Glu Leu val Leu Ala Pro ser Glu Glu Ser Glu Lys Tyr lie Leu 
65 70 75 80 

Thr Leu Gin Thr Val His Phe Thr Ser Glu Ala Val Glu Leu Gin Asp 
85 90 95 

Met Ser Leu Leu Ser He Gin Gin Gin Glu Gly val Gin Val val val 
100 105 110 

Gin Gin Pro Gly Pro Gly Leu Leu Trp Leu Glu Glu Gly Pro Arg Gin 
115 120 125 

Ser Leu Gin Gin Cys val Ala lie Ser lie Gin Gin Glu Leu Tyr ser 
130 135 140 
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Pro Gin Glu Met Glu val Leu Gin Phe His Ala Leu Glu Glu Asn val 
145 150 155 160 



Met val Ala ser Glu Asp Ser Lys Leu Ala val Ser Leu Ala Glu Thr 
165 170 175 



Ala Gly Leu lie Lys Leu Glu Glu Glu Gin Glu Lys Asn Gin Leu Leu 
180 185 190 



Ala Glu Arg Thr Lys Glu Gin Leu Phe Phe Val Glu Thr Met Ser Gly 
195 200 205 



Asp Glu Arg ser Asp Glu lie val Leu Thr val ser Asn Ser Asn val 
210 215 220 



Glu Glu Gin Glu Asp Gin Pro Thr Ala Gly Gin Ala Asp Ala Glu Lys 
225 230 235 240 



Ala Lys Ser Thr Lys Asn Gin Arg Lys Thr Lys Gly Ala Lys Gly Thr 
245 250 255 



Phe His Cys Asp Val Cys Met Phe Thr Ser Ser Arg Met Ser ser Phe 
260 265 270 



Asn Arg His Met Lys Thr His Thr ser Glu Lys pro His Leu Cys His 
275 280 285 



Leu Cys Leu Lys Thr Phe Arg Thr Val Thr Leu Leu Arg Asn His val 
290 295 300 



Asn Thr His Thr Gly Thr Arg Pro Tyr Lys Cys Asn Asp Cys Asn Met 
305 310 315 320 



Ala Phe val Thr Ser Gly Glu Leu Val Arg His Arg Arg Tyr Lys His 
325 330 335 



Thr His Glu Lys Pro Phe Lys cys Ser Met cys Lys Tyr Ala Ser val 
340 345 350 



Glu Ala Ser Lys Leu Lys Arg His val Arg ser His Thr Gly Glu Arg 
355 360 365 



Pro Phe Gin Cys Cys Gin Cys ser Tyr Ala Ser Arg Asp Thr Tyr Lys. 
370 375 380 



Leu Lys Arg His Met Arg Thr His Ser Gly Glu Lys Pro Tyr Glu Cys 



385 




395 



400 
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His lie cys His Thr Arg Phe Thr Gin Ser Gly Thr Met Lys lie His 
405 410 415 



He Leu Gin Lys His Gly Glu Asn val pro Lys Tyr Gin Cys Pro His 
420 425 430 



Cys Ala Thr lie lie Ala Arg Lys Ser Asp Leu Arg Val His Met Arg 
435 440 445 



Asn Leu His Ala Tyr Ser Ala Ala Glu Leu Lys Cys Arg Tyr Cys Ser 
450 455 460 



Ala val Phe His Glu Arg Tyr Ala Leu He Gin His Gin Lys Thr His 
465 470 475 480 



Lys Asn Glu Lys Arg Phe Lys Cys Lys His cys Ser Tyr Ala Cys Lys 
485 490 495 



Gin Glu Arg His Met Thr Ala His lie Arg Thr His Thr Gly Glu Lys 
500 505 510 



Pro Phe Thr Cys Leu Ser Cys Asn Lys Cys Phe Arg Gin Lys Gin Leu 
515 520 525 



Leu Asn Ala His Phe Arg Lys Tyr His Asp Ala Asn Phe lie Pro Thr 
530 535 540 



val Tyr Lys Cys ser Lys Cys Gly Lys Gly Phe Ser Arg Trp He Asn 
545 550 555 560 



Leu His Arg His Ser Glu Lys Cys Gly Ser Gly Glu Ala Lys Ser Ala 
565 570 575 



Ala ser Gly Lys Gly Arg Arg Thr Arg Lys Arg Lys Gin Thr He Leu 
580 585 590 



Lys Glu Ala Thr Lys Gly Gin Lys Glu Ala Ala Lys Gly Trp Lys Glu 
595 600 605 



Ala Ala Asn Gly Asp Glu Ala Ala Ala Glu Glu Ala Ser Thr Thr Lys 
610 615 620 



Gly Glu Gin Phe Pro Gly Glu Met Phe Pro val Ala Cys Arg Glu Thr 
625 630 635 640 



Thr Ala Arg val Lys Glu Glu val Asp Glu Gly val Thr Cys Glu Met 



645 



650 



655 
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Leu Leu Asn Thr Met Asp Lys 
660 

<210> 3 
<211> 2337 
<212> DNA 
<213> Mouse 



<400> 3 
ccattttgtg 


caccttgatc 


aaagcccatg 


tctactaggc cccagcacct ctgcacccca 


AO 


taaagattgc 


acgctctttt 


tccatcaggg 


gtcgtcacca 


tggctgccgc tgaggtccct 


xLkj 


gtcccttctg 


ggtacttcac 


ccagatcaaa 


gagcagaagt 


tgaagcctgg agacctagag 




gaggagaaag 


aggaggacgg 


ggtacaaaga 


gtggaagccc 


aggagggagt tgtcaaggag 




gtggaggccg 


agaacagttg 


cctgcttctg 


gaggccaggg 


ccccggtgga gagcgacagg 




cggatcctga 


ccctgcaaac 


ggtgcacctg 


gagtcccagg 


atgtgcacct acaggggctg 




ggatggctga 


gcgtgccaca 


ctctgaggag 


ctttcaggga cggtaccaga ggcggaaggc 




atactgcagt 


tgccatccgt 


gctgtggctc 


gacccagagc 


cccagctcag ccttcagcat 




tgcgtgacgg 


tcagcatccc 


ggaagagctg 


tacccaccag 


aggagctgca gcggatacat 




tttcacctgc 


tgagagagaa 


tgtgctaarg 


gccgaggaga acccagagtt aacaccagac 


Ron 


ttggacgaaa 


gcacagccct 


gaaaaagccc 


gaagaagatg 


aaaaggacca gctcccgccc 


OQV/ 


cagggagaga 


cagacaagag 


agaagagagg 


ttgctccttc 


tggaaatgaa accaaaagag 




ggaaaagacg 


acgaaattgt 


cc ugaccau L 


tcccatctaa 


gcctcgaaga acagcaagat 




ccaccagcgg 


ccaatcagac 


aagtgtgccg 


ggagccaaag 


ccgcaaaacc aaaacggcgg 


840 


aggcagacca 


agggaaagcc 


tcagagcttt 


cagtgtgaca 


cctgcccgtt cacttcctcc 


900 


aagctctcaa 


ctttcaatcg 


tcacatcaaa 


attcacagca atgagaggcc acacctgtgt 


960 


cacctgtgcc 


tgaaggcctt 


ccggactgtc 


actcttctta ggaaccatgt gaacacccac 


1020 


acaggaacca 


ggccccacaa 


gtgcagggac 


tgcgacatgg 


cgtttgtcac cagcggagaa 


1080 


ctcgtccggc 


acaggcgtta 


caaacacact 


tatgagaagc 


ccttcaagtg ctccctgtgc 


1140 


aagtacgcca 


gcgtcgaggc 


aagcaagatg 


aagcgtcaca tccgctcaca cacgggtgag 


1200 


cgtcccttcc 


agtgttgcca 


gtgtgcttat 


gccagcaggg 


actcctacaa gctgaagcgc 


1260 


cacatgagga 


cacactcagg 


tgagaagccg 


tatgaatgtc 


ccacctgtca cgtccggttc 


1320 


acccagagcg 


ggaccatgaa 


aatccatata 


gcacagaagc 


acggagagaa tgtgcccaaa 


1380 


tacgagtgtc 


cccactgtgc 


caccatcatc 


gcgaggaaga 


gcgacctgcg tgtccatctg 


1440 


cgtaacctgc 


acagccagag 


cccggaggag 


atgaagtgcc gatactgtcc cgctggcttc 


1500 


catgagcgct 


atgccctcat 


tcagcaccag 


aggacccaca 


agaacgagaa gaagttcaag 


1560 
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gagcgatgct 


taaagqcqca 


catgcgcatg 


1620 




CXU Civ&M ^ ^ ^ ^ 


ctcctaccta 


Qcctgcaaca 


agcacttccg 


acagaagcag 


1680 






na a n't arc at: 

u %jl ^ l. a w w w& V 


gacccgaact 


tcgtccccaa 


tctgcacctg 


1740 








"taaaatiaa cc 


tgcagagaca 


cagaaagaag 


1800 


"tTft n^c rend 


aof atria OAf 






aaaQaccaot 


gacaaggaca 


1860 


caggcctcgg 




aggacacaag 


gaaggggagc 


ctcagtgccc 


^^9999^9^^9 




gctctgggcc 


accaaggaga 


agcagcgggg 


agccagagcc 


cagaccacgg 


ccttacctgc 


1980 


gagatgatct 


ttaacatgat 


ggataagtga 


tggataagtg 


agcagtcgtg 


cctctccgtg 


2040 


cagtggcctc 


tgggggaaga 


aaccagttag 


aaataagttc 


ccagacacag 


cacagtgttc 


2100 


tcagagtttg 


agatagtgtg 


tagaaatgtt 


tgagagaagg 


ggaaaaaaac 


cctgcagcta 


2160 


tttccaaaga 


cttgagtcag 


agctcgaagt 


gaaggtgcac 


atatctgggc 


cctagcaggt 


2220 


gcccagaatg 


agtcagggac 


agattctagg 


tgatacttat 


gtccacgggg 


gctcagacca 


2280 


gttaacgcct 


tggtggtcag 


agcagaaaat 


tttttgagtt 


gttgtaccca 


ccctcaa 


2337 



<210> 4 

<211> 636 

<212> PRT 

<213> Mouse 

<400> 4 

Met Ala Ala Ala Glu val Pro val Pro ser Gly Tyr Phe Thr Gin lie 
1 5 10 15 . 

Lys Glu Gin Lys Leu Lys Pro Gly Asp Leu Glu Glu Glu Lys Glu Glu 
20 25 30 

Asp Gly val Gin Arg val Glu Ala Gin Glu Gly val val Lys Glu val 
35 40 45 

Glu Ala Glu Asn Ser Cys Leu Leu Leu Glu Ala Arg Ala Pro val Glu 
50 55 60 

ser Asp Arg Arg He Leu Thr Leu Gin Thr val His Leu Glu Ser Gin 
65 70 75 80 



Asp Val His Leu Gin Gly Leu Gly Trp Leu Ser val Pro His Ser Glu 
85 90 95 



Glu Leu ser Gly Thr val Pro Glu Ala Glu Gly lie Leu Gin Leu Pro 
100 105 110 



• 
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ser val Leu Trp Leu Asp Pro Glu Pro Gin Leu Ser Leu Gin His Cys 
115 120 125 



val Thr val ser lie Pro Glu Glu Leu Tyr Pro Pro Glu Glu Leu Gin 
130 135 140 



Arg lie His Phe His Leu Leu Arg Glu Asn Val Leu Met Ala Glu Glu 
145 150 155 160 



Asn Pro Glu Leu Thr Pro Asp Leu Asp Glu Ser Thr Ala Leu Lys Lys 
165 170 175 



Pro Glu Glu Asp Glu Lys Asp Gin Leu Pro Pro Gin Gly Glu Thr Asp 
180 185 190 



Lys Arg Glu Glu Arg Leu Leu Leu Leu Glu Met Lys Pro Lys Glu Gly 
195 200 205 



Lys Asp Asp Glu lie Val Leu Thr lie Ser His Leu ser Leu Glu Glu 
210 215 220 



Gin Gin Asp Pro Pro Ala Ala Asn Gin Thr ser val Pro Gly Ala Lys 
225 230 235 240 



Ala Ala Lys Pro Lys Arg Arg Arg Gin Thr Lys Gly Lys Pro Gin Ser 
245 250 255 



Phe Gin Cys Asp Thr Cys Pro Phe Thr Ser ser Lys Leu Ser Thr Phe 
260 265 270 



Asn Arg His lie Lys He His Ser Asn Glu Arg Pro His Leu cys His 
275 280 285 



Leu Cys Leu Lys Ala Phe Arg Thr Val Thr Leu Leu Arg Asn His val 
290 295 300 



Asn Thr His Thr Gly Thr Arg Pro His Lys Cys Arg Asp Cys Asp Met 
305 310 315 320 



Ala Phe val Thr ser Gly Glu Leu val Arg His Arg Arg Tyr Lys His 
325 330 335 



Thr Tyr Glu Lys Pro Phe Lys Cys ser Leu Cys Lys Tyr Ala ser val 
340 345 350 



Glu Ala Ser Lys Met Lys Arg His lie Arg ser His Thr Gly Glu Arg 



355 



360 



365 
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Pro Phe Gin cys Cys Gin Cys Ala Tyr Ala Ser Arg Asp Ser Tyr Lys 
370 375 380 



Leu Lys Arg His Met Arg Thr His Ser Gly Glu Lys Pro Tyr Glu cys 
385 390 395 400 



pro Thr cys His val Arg Phe Thr Gin ser Gly Thr Met Lys lie His 
405 410 415 



He Ala Gin Lys His Gly Glu Asn val Pro Lys Tyr Glu Cys Pro His 
420 425 430 



cys Ala Thr He lie Ala Arg Lys ser Asp Leu Arg val His Leu Arg 
435 440 445 



Asn Leu His Ser Gin Ser Pro Glu Glu Met Lys Cys Arg Tyr Cys Pro 
450 455 460 



Ala Gly Phe His Glu Arg Tyr Ala Leu He Gin His Gin Arg Thr His 
465 470 475 480 



Lys Asn Glu Lys Lys Phe Lys Cys Lys Gin cys Asp Tyr Ala Cys. Lys 
485 490 495 



Gin Glu Arg Cys Leu Lys Ala His Met Arg Met His Thr Gly Glu Lys 
500 505 510 



Pro Phe ser cys Leu Ala Cys Asn Lys His Phe Arg Gin Lys Gin Leu 
515 520 525 



Leu Thr val His Leu Arg Lys Tyr His Asp Pro Asn Phe Val Pro Asn 
530 535 540 



Leu His Leu Cys Leu Lys Cys Asp Lys Arg Phe ser Arg Trp Ser Asn 
545 550 555 560 



Leu Gin Arg His Arg Lys Lys Cys Asp Pro Glu His Glu Thr Leu Ala 
565 570 575 



pro Asn Lys Asp Arg Arg Pro val Thr Arg Thr Gin Ala ser Glu Gly 
580 585 590 



Glu Ala Gly His Lys Glu Gly Glu Pro Gin Cys Pro Gly Glu Gin Ala 
595 600 605 



Leu Gly His Gin Gly Glu Ala Ala Gly Ser Gin ser pro Asp His Gly 



610 



615 



620 
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Leu Thr Cys Glu Met lie Phe Asn Met Met Asp Lys 
625 630 635 



<210> 


5 


<211> 


44 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


5 



ctagagcccc tcggccgccc cctcgcggcg cgccctcccc gctt 44 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 6 

gagcctgtgg agcgattaaa cc 



<210> 7 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<210> 8 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 8 

cttctttggc ggcagcggcg 



<210> 9 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 9 



<400> 7 

ccgccgccgc tccac 



15 
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cgcgccacac cccccgc 17 



<210> 10 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 10 

ccccagaacc agac 14 

<210> 11 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 11 

acttcagtct tcatctg 17 

<210> 12 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 12 

tgtgagcttt gcagttacac 20 

<210> 13 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 13 

actgttctga atgccctg 18 

<210> 14 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 14 

cggcgttcaa atttgg 16 
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<210> 15 
<211> 20 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 15 

cgagtacctg tgtgtgtgtt 20 

<210> 16 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 
<400> 16 

gtgcccagac tgcga 15 

<210> 17 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 17 

aatcgcacat ggaacac 17 

<210> 18 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 18 

ttcaagtgtt ccatgtg 17 

<210> 19 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 19 

ctgctggcat aactgcac 18 



<210> 20 
<211> 19 
<212> DNA 
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<213> Artificial 

<220> 

<223> Synthetic 
<400> 20 

cacatacaag ctgaaaagg 19 



<210> 21 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 21 

gcatcttcat ggtaccac . 18 



<210> 22 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 



<210> 23 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 23 

cgctcatgaa acacagc 17 



<210> 24 

<211> 18 

<212> ONA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 22 

gtcatagccc gaaaaagtg 



19 



<400> 24 

gtgtgaccag tgtgatta 



18 



<210> 25 
<211> 17 
<212> DNA 



<213> Artificial 



<220> 
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<223> Synthetic 

<400> 25 

ttctggcgga aggtctt 17 

<210> 26 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 26 

caagcgctat cacgacc 17 

<210> 27 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 
<400> 27 

tctgcatgtc ttgccat 17 

<210> 28 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 28 

tcctctgaca gtgaaaatgc 20 

<210> 29 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 29 

cacaggctga ggctctgg 18 

<210> 30 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 30 
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19 



cagaatacag gtgcaattg 



<210> 31 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 31 

caccggtcca tcatgctg 18 



<210> 32 

<211> 27 

<212> ONA 

<213> Artificial 

<220> 

<223> Synthetic 



<210> 33 

<211> 29 

<212> DNA 

<213> Artificial - ~ 

<220> 

<223> Synthetic 

<400> 33 

cactggcaac actgaaaggg gcgctcccc 29 



<210> 34 

<211> 22 

<212> DNA 

<213> Artificial 
<220> 

<223> Synthetic 



<210> 35 

<211> 19 

<212> DNA 

<213> Artificial 
<220> 

<223> Synthetic 

<400> 35 

gacgagttct ccactggtg 19 



<400> 32 

gccagtgtgg aggcaagtaa attgaag 



27 



<400> 34 

tcgtcatatg aaaactcaca cc 



22 
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<210> 36 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 36 



<210> 37 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 37 

gagtgcgttc tcatgtgg 18 



<210> 38 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<210> 39 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 39 

gcacaatggg gacac 15 



<210> 40 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



aacatactca tgagaaaccc 



20 



<400> 38 

gagcgcccct ttcagtgt 



18 



<400> 40 

acccagagcg ggaccatgaa a 



21 



<210> 41 
<211> 19 
<212> DNA 
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<213> Artificial 

<220> 

<223> synthetic 
<400> 41 



<210> 42 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 42 

cataagaatg agaagagg 18 



<210> 43 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<210> 44 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 44 

ttgtgcagtt atgccagcag g 21 

<210> 45 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



gacagcagaa cagtagcgg 



19 



<400> 43 

aagttgcttc tgtcggaaa 



19 



<400> 45 

gtgcttctgt aaaatgtgca tc 



22 



<210> 46 
<211> 27 

<212> DNA 



<213> Artifi 



cial 



<220> 
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<223> Synthetic 

<400> 46 



<210> 47 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 47 

gcattcgtaa ggcttctcac ctgagtg 27 



<210> 48 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 



<210> 49 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 49 

cctgtgtggg tgttcacatg gttcctaaga ag 32 



caggccctac aagtgtaacg actgcaa 



27 



<400> 48 

gagagacaga caagagagaa gagaggttgc tc 



32 
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